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Description 

This invention relates to the field of recombinant DNA biotechnology. In one of its aspects, the invention 
relates to DNA fragments which regulate the transcription of DNA into messenger RNA, and the initiation 
5 and termination of the translation of messenger RNA into protein. In another aspect, the invention relates to 
expression vectors which incorporate the above-described DNA fragments. In yet another aspect, the 
invention relates to novel microorganisms transformed with the above-described expression vectors. In a 
further aspect, the invention relates to the production of polypeptides. 

io Background 

As recombinant DNA technology has developed in recent years, the controlled production by microor- 
ganisms of an enormous variety of useful polypeptides has become possible. Many eukaryotic polypep- 
tides, such as for example human growth hormone, leukocyte interferons, human insulin and human 
is proinsulin have already been produced by various microorganisms. The continued application of techniques 
already in hand is expected in the future to permit production by microorganisms of a variety of other useful 
polypeptide products. 

The basic techniques employed in the field of recombinant DNA technology are known by those of skill 
in the art. The elements desirably present in order for a host microorganism to be useful for the practice of 
20 recombinant DNA technology include, but are not limited to: 

(1) a gene encoding one or more desired polypeptide(s) and provided with adequate control sequences 
required for expression in the host microorganism, 

(2) a vector, usually a plasmid, into which the gene can be inserted. The vector serves to guarantee 
transfer of the gene into the cell and maintenance of DNA sequences in the cell as well as a high level of 

25 expression of the above-mentioned gene, and 

(3) a suitable host microorganism into which the vector carrying the desired gene can be transformed, 
where the host microorganism also has the cellular apparatus to allow expression of the information 
coded for by the inserted gene. 

A basic element employed in recombinant DNA technology is the plasmid, which is extrachromosomai, 

30 double-stranded DNA found in some microorganisms. Where plasmids have been found to naturally occur 
in microorganisms, they are often found to occur in multiple copies per cell. In addition to naturally 
occurring plasmids, a variety of man-made plasmids, or hybrid vectors, have been prepared. Included in the 
information encoded in plasmid DNA is that required to reproduce the plasmid in daughter cells, i.e., an 
autonomously replicating sequence or an origin of replication. One or more phenotypic selection char- 

35 acteristics must also be included in the information encoded in the plasmid DNA. The phenotypic selection 
characteristics permit clones of the host cell containing the plasmid of interest to be recognized and 
selected by preferential growth of the cells in selective media. 

The utility of plasmids lies in the fact that they can be specifically cleaved by one or another restriction 
endonuclease or restriction enzyme, each of which recognizes a specific, unique site on the plasmid DNA. 

40 Thereafter, homologous genes, heterologous genes, i.e., genes derived from organisms other than the host, 
or gene fragments may be inserted into the plasmid by endwise joining of the cleaved plasmid and desired 
genetic material at the cleavage site or at reconstructed ends adjacent to the cleavage site. The resulting 
recombined DNA material can be referred to as a hybrid vector. 

DNA recombination is performed outside the host microorganism. The resulting hybrid vector can be 

45 introduced into the host microorganism by a process known as transformation. By growing the transformed 
microorganism, large quantities of the hybrid vector can be obtained. When the gene is properly inserted 
with reference to the portions of the plasmid which govern transcription and translation of the encoded DNA 
message, the resulting hybrid vector can be used to direct the production of the polypeptide sequence for 
which the inserted gene codes. The production of polypeptide in this fashion is referred to as gene 

so expression. 

Gene expression is initiated in a DNA region known-as the promoter region. In the transcription phase of 
expression, the DNA unwinds exposing it as a template for synthesis of messenger RNA. RNA polymerase 
binds to the promoter region and travels along the unwound DNA from its 3' end to its 5* end, transcribing 
the information contained in the coding strand into messenger RNA (mRNA) from the 5* end to the 3* end of 
55 the mRNA. The mess nger RNA is, in turn, bound by ribosomes, wher the mRNA is translat d into the 
polypeptide chain. Each amino acid is encoded by a nucleotide triplet or codon within what may be r ferred 
to as the structural gene, i. ., that part of the gene which ncodes the amino acid sequenc of the 
expressed product. Since thr e nucleotides code for th production of each amino acid, it is possible for a 
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nucleotide sequence to be "read" in three different ways. The specific reading fram which encodes the 
desired polypeptide product is referred to as the proper reading frame. 

After binding to the promoter, RNA polymerase first transcribes a 5' leader region of mRNA, then a 
translation initiation or start codon, followed by the nucleotide codons within the structural gene itself. In 

5 order to obtain the desired gene product, it is necessary for the initiation or start codon to correctly initiate 
the translation of messenger RNA by the ribosome in the proper reading frame. Finally, stop codons are 
transcribed at the end of the structural gene which cause any additional sequences of mRNA to remain 
untranslated into peptide by the ribosomes, even though additional sequences of mRNA had been formed 
by the interaction of RNA polymerase with the DNA template. Thus, stop codons determine the end of 

io translation and therefore the end of further incorporation of amino acids into the polypeptide product. The 
polypeptide product can be obtained by lysing the host cell and recovering the product by appropriate 
purification from other microbial protein, or, in certain circumstances, by purification of the fermentation 
medium in which the host cells have been grown and into which the polypeptide product has been 
secreted. 

75 In practice, the use of recombinant DNA technology can create microorganisms capable of expressing 
entirely heterologous polypeptides, i.e., polypeptides not ordinarily found in, or produced by, a given 
microorganism — so called direct expression. Alternatively, there may be expressed a fusion protein, i.e., a 
heterologous polypeptide fused to a portion of the amino acid sequence of a homologous polypeptide, i.e., 
polypeptides found in. or produced by, the wild-type (non-transformed) host microorganism — so called 

20 indirect expression. With indirect expression, the initially obtained fusion protein product is sometimes 
rendered inactive for its intended use until the fused homologous/heterotogous polypeptide is cleaved in an 
extracellular environment. Thus, for example, cyanogen bromide cleavage of methionine residues has 
yielded somatostatin, thymosin alpha 1 and the component A and B chains of human insulin from fused 
homologous/heterologous polypeptides, while enzymatic cleavage of defined residues has yielded beta 

25 endorphin. 

Up to now, commercial efforts employing recombinant DNA technology for producing various polypep- 
tides have centered on Escherichia coti as a host organism. However, in some situations E. cofi may prove 
to be unsuitable as a host. For example, E. coti contains a number of toxic pyrogenic factors that must be 
eliminated from any polypeptide useful as a pharmaceutical product. The efficiency with which this 

30 purification can be achieved will, of course, vary with the particular polypeptide. In addition, the proteolytic 
activities of E, coti can seriously limit yields of some useful products. These and other considerations have 
led to increased interest in alternative hosts, in particular, the use of eukaryotic organisms for the production 
of polypeptide products is appealing. 

The availability of means for the production of polypeptide products in eukaryotic systems, e.g., yeast, 

35 could provide significant advantages relative to the use of prokaryotic systems such as E. coti for the 
production of polypeptides encoded by recombinant DNA. Yeast has been employed in large scale 
fermentations for centuries, as compared to the relatively recent advent of large scale E. coli fermentations. 
Yeast can generally be grown to higher cell densities than bacteria and are readily adaptable to continuous 
fermentation processing. In fact, growth of yeast such as Pichia pastoris to ultra-high cell densities, i.e., cell 

40 densities in excess of 100 g/L, is disclosed by Wegner in U.S. 4,414,329 (assigned to Phillips Petroleum 
Co.). Additional advantages of yeast hosts include the fact that many critical functions of the organism, e.g., 
oxidative phosphorylation, are located within organelles, and hence not exposed to the possible deleterious 
effects of the organism's production of polypeptides foreign to the wild-type host cells. As a eukaryotic 
organism, yeast may prove capable of glycosylating expressed polypeptide products where such glycosyla- 

45 tion is important to the bioactivity of the polypeptide product. It is also possible that as a eukaryotic 
organism, yeast will exhibit the same codon preferences as higher organisms, thus tending toward more 
efficient production of expression products from mammalian genes or from complementary DNA (cDNA) 
obtained by reverse transcription from, for example, mammalian mRNA. 

The development of poorly characterized yeast species as hosWector systems is severely hampered 

so by the lack of knowledge about transformation conditions and suitable vectors. In addition, auxotrophic 
mutations are often not available, precluding a direct selection for transformants by auxotrophic complemen- 
tation. If recombinant DNA technology is to fully sustain its promise, new host/vector systems must be 
devised which facilitate the manipulation of DNA as well as optimize expression of inserted DNA sequences 
so that the desired polypeptide products can be prepared under controlled conditions and in high yield. 

55 

Obj cts of the Invention 

An object of our invention is therefor a novel r gulatory r gion responsrv to the pr sence of methanol. 
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A further object of the invention is a novel catabolite s nsitiv r gulatory region which is responsiv to 
the presence of some carbon sources but which is not responsive to the presence of other carbon sources. 
Another object of the invention is a novel regulatory region responsive to carbon source starvation. 
Yet another object of our invention is novel vectors capable of expressing an inserted polypeptide 
5 coding sequence. 

Still another object of our invention is novel yeast strain of the genus Pichisu 

A further object of our invention is a process for producing polypeptides employing the novel yeast 
strain as described hereinabove. 

These and other objects of our invention will become apparent from the disclosure and claims herein 
io provided. 

Statement of the Invention 

In accordance with the present invention, we have discovered, isolated and characterized DNA 
is sequences which control the transcription of DNA into messenger RNA and translation of the messenger 
RNA to give a polypeptide product. The novel DNA sequences of this invention are useful for the production 
of polypeptide products by (a) yeast strains which are capable of growth on methanol, as a carbon and 
energy source, (b) yeast strains which are capable of growth on glucose, ethanol, fructose and the like; and 
(c) yeast strains which are capable of growth on glycerol, galactose, acetate and the like. 

20 

Brief Description of the Figures 

Figure 1 is a correlation of the relationship between the genomic clone (pPG 6.0) and cDNA clone (pPC 
1 5.0) for protein p76. 

25 Figure 2 is a correlation of the relationship between the genomic clone (pPG 4.0) and cDNA clones 
(pPC 8.3 and pPC 8.0) for protein p72 (alcohol oxidase). 

Figure 3 is a correlation of the relationship between the genomic clone (pPG 4.8) and cDNA clone (pPC 
6.7) for protein p40. 

Figure 4 provides restriction maps of regulatory regions of the invention from clone pPG 6.0. 
30 Figure 5 is a restriction map of the regulatory region of the invention from clone pPG 4.0. 
Figure 6 is a restriction map of the regulatory region of the invention from clone pPG 4.8. 
Figure 7 is a restriction map of a sequence of DNA obtained from the 3* end of the p76 structural gene. 
Figure 8 provides restriction maps of sequences of DNA obtained from the 3* end of the p72 (alcohol 
oxidase) structural gene. 

35 Figure 9 is a restriction map of a sequence of DNA obtained from the 3* end of the p40 structural gene. 

Figure 10 is a restriction map of the protein p76 structural gene and the 5* regulatory region therefor. 

Figure 1 1 is a restriction map of the protein p40 structural gene and the 5' regulatory region therefor. 

Figure 12 is a restriction map of the protein p76 cDNA. 

Figure 1 3 is a restriction map of the protein p72 (alcohol oxidase) cDNA. 
ao Figure 14 is a restriction map of the protein p40 cDNA. 

Figure 15 provides restriction maps of two novel p76 regulatory region-/acZ DNA constructs of the 
invention. 

Figure 16 is a restriction map of a novel p72 (alcohol oxidase) regulatory region-SacZ DNA construct of 
the invention. 

45 Figure 17 is a restriction map of plasmid pSAOH 1. 

Figure 18 is a restriction map of plasmid pSAOH 5. 

Figure 19 is a restriction map of plasmid pSAOH 10. 

Figure 20 is a restriction map of plasmid pTAFH.85. 

Figure 21 is a restriction map of plasmid pT76H 1 . 
so Figure 22 is a restriction map of plasmid pT76H 2. 

Figure 22a is a restriction map of plasmid pT76H3. 

Figure 22b is a restriction map of plasmid pT76H4. 

Figure 23 is a restriction map of plasmid pYA2. 

Figure 24 is a restriction map of plasmid pYA4. 
55 Figure 25 is a r striction map of plasmid pYJ8. 

Figure 26 is a r striction map of plasmid pYJSA C/a. 

Figur 27 is a restriction map of plasmid pYJ30. 

Figure 28 provides a r striction map of plasmid pTAFH 1 and shows how th plasmid was derived. 



5 



EP 0 183 071 B1 



Figure 29 provides a restriction map of pi asm id pTAO 12 and shows how the plasmid was derived. 
Figure 30 is a restriction map of plasmid pTAOl 3. 
Figure 30a is a restriction map of plasmid pT76U1. 

Figure 31 provides a restriction map of plasmid pTAOl and shows how the plasmid was derived. 
5 Figure 32 provides a restriction map of plasmid pTAF.85 and shows how the plasmid was derived. 

Figure 33 provides a restriction map of plasmid YEp1 3. 
Figure 34 is a restriction map of pBPfl . 

The following abbreviations are used throughout this application to represent the restriction enzymes 





employed: 






TO 


A 


= 


AsuU 




B 




BamHl 




Ba 


= 


eg/11 




Be 


= 


Bcl\ 




C 


s 


C/al 


75 


H 2 


= 


HtncU 




H 3 


= 


HindlU 




K 


= 


Kpn\ 




Ndi 




Nde\ 




Nr 




NriA 


20 


Ps 




P$t\ 




Pvi 




Pvu\ 




Pv 2 




PvuU 




Ri 




EcoRI 




Rs 




EcoRV 


25 


Rs 




Rsa\ 




S 




Sal\ 




s 3 




Sau3A\ 




Sc 




Sac\ 




Sm 




Sma\ 


30 


Sp 




$ph\ 




Ss 




Sst\ 




St 




SttA 




T 




Taq\ 




Th 




Thai 


35 


Xb 




Xbal 




Xh 




XhcA 




Xm 




Xmai 



In the attached figures, restriction sites employed for manipulation of DNA fragments, but which are 
destroyed upon ligation are indicated by enclosing the abbreviation for the destroyed site in parentheses. 

40 

Detailed Description of the Invention 



In accordance with the present invention, there is provided a novel DNA fragment comprising a 
regulatory region responsive to at least one of the following conditions: the presence of methanol, carbon 

45 source starvation when cells are grown on some substrates other than methanol, and the presence of non- 
catabolite repressing carbon sources other than methanol. The regulatory region of the DNA fragment of 
this invention is capable of controlling the transcription of messenger RNA when positioned at the 5* end of 
the DNA which codes for the production of messenger RNA. Also included within the scope of our invention 
are mutants of the above-described DNA fragment. 

so Further in accordance with the present invention, there is provided a DNA fragment which comprises a 
regulatory region which is capable of controlling the polyadenylation, termination of transcription and 
termination of translation of messenger RNA when positioned at the 3' end of the polypeptide coding region 
which codes for the production of messenger RNA, wherein the transcription and translation of the 
messenger RNA is controlled by a regulatory region which is responsive to at least one of the following 

ss conditions: the presenc of m thanol, carbon source starvation when cells are grown on some substrates 
other than methanol and the presence of non-catabolite repressing carbon sources oth r than methanol. 
Also included within th scope of our inv ntion ar mutants of the above-described DNA fragment. 

Still further in accordance with a specific embodiment of the inv ntion, th re are provid d DNA 
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fragments which direct th incorporation of encoded polypeptide into peroxisomes. Peroxisomes are 
intracellular bodies present in large amounts in methanol grown yeast cells. These intracellular bodies serve 
to isolate the incorporated polypeptide product from intracellular fluids and enzymes such as proteases. 

In accordance with another embodiment of the invention, genes coding for the production of alcohol 
s oxidase, a protein of about 40 kilodaltons and a protein of about 76 kilodaltons are provided. 

In accordance with yet another embodiment of the present invention, plasmids and transformed 
organisms containing the above-described DNA fragments are provided. 

In accordance with still another embodiment of the invention, methods are provided for producing the 
plasmids and DNA fragments of the invention, as well as heterologous polypeptides, i.e., polypeptides not 
io native to the host organisms. 

Isolation of Regulatable Genes from Pichia pas tons 

An approximately 20,000 member cDNA library was prepared in E. coif with poly A+ RNA isolated 
rs from Pichia pastoris cells grown on methanol as the sole carbon source (See Example III). The library was 
screened by hybridization using kinased poly A+ RNA isolated from Pichia pastoris grown either in the 
presence of methanol or ethanol. After several rounds of this plus-minus screening, three distinct, non- 
homologous cDNA clones were identified as being copies of methanol specific messenger RNA's. These 
clones were designated as pPC 6.4, pPC 8.0, and pPC 15.0 and were determined to contain inserts of 470, 
20 750 and 1 1 00 nucleotides in length, respectively. 

In an attempt to obtain cDNA clones of longer length, a second cDNA library was prepared using milder 
S1 nuclease digestion conditions than used for the preparation of the first cDNA library and the members of 
this new library screened individually with ^P-labeled cDNA clones pPC 6.4, pPC 8.0, and pPC 1 5.0. As a 
result, larger cDNA clones were isolated corresponding to cDNA clones pPC 6.4 and pPC 8.0. The larger 
25 clones, pPC 6.7 and pPC 8.3, were found to contain inserts measuring 1200 and 2100 nucleotides, 
respectively (See Figures 2 and 3). A cDNA clone possessing an insert larger than the 1100 nucleotides for 
pPC 15.0 has not been observed after screening more than 40,000 cDNA clones. 

The isolation of the genomic DNA fragments corresponding to each of these cDNA clones was 
accomplished by first cutting out and electroeluting from agarose gels Pichia pastoris DNA fragments of 
30 restriction endonuclease treated chromosomal DNA that hybridized with ^P-labeled pPC 15.0, pPC 8.0, or 
pPC 6.4. Then the eluted genomic DNA fragments were cloned into Escherichia coli and the appropriate 
genomic clones identified by screening several times with each of the above cDNA probes. 

The relationship of each cDNA clone to its corresponding genomic clone is illustrated in Figures 1, 2, 
and 3. pPC 15.0 is encoded within a 6000 nucleotide H/ndlll genomic fragment present in clone pPG 6.0 
35 (Figure 1). The 5* end of the gene encoded by pPC 15.0 is oriented toward the 1300 bp Mndlll-FcoRI 
fragment contained in pPG 6.0, while the 3* end of the gene is proximal to the Pst\ sites in pPG 6.0. 

The cDNA clone pPC 8.3 is included within the genomic clone pPG 4.0 (Figure 2). pPG 4.0 contains an 
EcoR\-PvU\\ insert of 4000 nucleotides of contiguous genomic DNA. The orientation of pPC 8.3 within pPG 
4.0 places the 5' end of the gene for this cDNA clone close to the Ba/nHI sites while the 3* end of this gene 
40 is located near the PviA\ site. The orientation of pPC 8.0 (a related cDNA clone) within pPG 4.0 places the 
5* end of this cDNA clone close to the Kpn\ site at the 3* end of pPG 4.0 and the 3' end of the cDNA clone 
is located near the PvtAX site. 

The cDNA clone pPC 6.7 is located entirely within a 4800 nucleotide EcoR\-BamH\ genomic fragment 
(Figure 3). Clone pPC 8.4 is in turn located completely within cDNA clone pPC 6.7. Since pPC 6.7 was a 
45 more complete copy than pPC 6.4, the latter was not investigated further. The 5* end of the gene is 
positioned closer to the flamHI end than to the EcoRI end of the genomic clone pPG 4.8 (Figure 3). 

In all of these above-described genomic clones, there are at least 1.2 kitobase pairs of flanking genomic 
DNA sequence which are 5* to the structural genes copied in each of the cDNA clones. 

Each of the genomic and cDNA clones described above have been deposited with the Northern 
so Regional Research Center of the United States of America, Peoria, Illinois. 
All clones have been deposited in E. coil hosts: 



55 
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Plasmid 


Host 


Accession No. 


pPG 6.0 
pPG 4.0 
pPG 4.8 
pPC 15.0 
pPC 8.3 
pPC 6.7 
pPC 8.0 


E. coli LE392-pPG 6.0 
E. coli LE392-pPG 4.0 
E. coli LE392-pPG 4.8 
E. coll LE392-pPC 15.0 
E. CO// LE392-pPC 8.3 
E. CO// LE392-pPC 6.7 
E. CO/7 MM294-pPC 8.0 


NRRL B-15867 
NRRL B-15868 
NRRL B-15869 
NRRL B-15870 
NRRL B-15871 
NRRL B-15872 
NRRL B-15873 



All of the above organisms have been irrevocably deposited and made available to the public as of August 
31. 1984. 

Uniqueness of pPG 6.0, pPG 4.0 and pPG 4.8 to Methanol Assimilating Yeasts 

Each of the cDNA clones described above have been labeled and employed as probes of chromosomal 
DNA sequences from a number of methanol assimilating yeasts and a methanol non-assimilating yeast. 
Homologous genes for all three of the cDNAs were found to exist in essentially all methanol assimilating 
y easts, but were clearly not present in methanol non-assimilating yeast (S. cerevisiae). It is thus believed 
that these genes are unique to methanol assimilating yeast. In addition, the southern hybridization 
experiments detailed in Example XVII demonstrate that a high degree of homology exists between these 
unique methanol responsive genes from various methanol assimilating yeasts. 

Characterization of the RNA Transcripts of the pPG 6.0, pPG 4.0 and pPG 4.8 Genes 

The influence of methanol on the expression of each of these cloned genes can be observed by 
studying the effects on transcription of these genes. Isolated poly A+ RNA from Pichia pas ton's cells 
grown with ethanol or methanol as sole carbon source was used to prepare Northern hybridization filters 
(See Example IV)- Three identical pairs of filters from methanol and ethanol grown cells (See Example I) 
were probed separately with ^P-labeled pPC 15.0, pPC 8.0 and pPC 6.4. The clones pPC 15.0, pPC 8.0, 
and pPC 6.4 hybridized to RNA molecules (of approximately 2400, 2300, and 1300 nucleotides, respec- 
tively) from methanol grown cells. No hybridization of clones pPC 15.0 and pPC 8.0 with the hybridization 
probes was observed with RNA obtained from cells grown in the presence of ethanol. However, when RNA 
isolated from celts grown on ethanol was probed with pPC 6.4, the clone hybridized to a 1 300-nucleotide 
RNA molecule identical to that seen with methanol-grown cells but at an estimated (qualitatively) 5-fold 
lower level. 

Size Determination of Protein Products Encoded by pPG 6.0. pPG 4.0 and pPG 4.8 

To determine what protein products were encoded by each of the above-identified cDNA clones, poly 
A + RNA from Pichia pastoris cells grown on methanol was selectively hybridized to each of the cDNA 
clones. The hybrid-selected mRNA, i.e., mRNA which hybridized to each of the cDNA clones, was then 
translated In vitro and each of the protein products resolved by electrophoresis using SDS-denaturing 
conditions (See Example V)- The results of these in vitro positive hybridization-translation experiments 
indicated that clones pPC 15.0, pPC 8.3, and pPC 6.7 select mRNAs which encode information for 
polypeptides of 76,000 (p76), 72,000 (p72) and 40,000 (p40) daltons, respectively. These same proteins are 
observed when total poly A + RNA (i.e., not hybrid-selected) from methanol grown Pichia pastoris cells is 
translated in the same in vitro system. 

I dentification of p72 as Alcohol Oxidase 

A. Molecular Weight Comparison 

A sample highly enriched for alcohol oxidase protein was prepared by dialysis of cleared cell lysates 
against H 2 0 (S Example VII). The crystalline pr cipitate resulting from this dialysis was shown by SDS 
lectrophor sis to contain predominantly two polypeptides of 76,000 and 72,000 daltons, respectiv ly. The 
precipitate was subjected to additional purification by chromotography through Sephacryl 200 (See Example 
VII), which demonstrated that alcohol oxidase activity corr spond d to the activity of the purified 72,000 
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dalton polypeptide. The size of this polypeptid was id ntical to that of the protein product selected by 
cDNA clone pPC 8.3 (See Example X). 

B. Immunoprecipitation 

5 

Additional support that clones pPC 8.3 and pPG 4.0 encode the alcohol oxidase structural gene was 
obtained by means of an immunological approach (Example XI). The protein preparation isolated from 
Pichia pastoris containing both the 76,000 and 72.000 dalton polypeptides was used to raise specific 
antisera for these polypeptides in rabbits. When the hybrid-selected poly A + RNA from clone pPC 8.3 was 
io translated in vitro, only the 72,000 dalton translation product was precipitated by the antisera made against 
the protein preparation from Pichfa pastoris ceils. 

C. Predicted/Actual Amino Acid Sequence Comparison 

is To further verify that clone pPC 8.3 is in fact the cDNA clone encoding Pichia pastoris alcohol oxidase, 
the amino acid sequence for the amino terminal end of the protein was compared with the predicted amino 
acid sequence encoded by pPC 8.3. Thus, the NH 2 -terminal amino acid sequence (Sequence A) of the 
isolated 72,000 dalton protein was determined (Example VIII) to be: 

20 

Ala-Ile-Pro-Glu-Glu-Phe-Asp-Ile-Leu-Val-Leu-Gly-Gly-Gly-Ser- 
Ser-Gly-Ser. 

Sequence A 

25 

In parallel, the nucleotide sequence of the 5' end of the gene encoded in pPC 8.3 and pPG 4.0 was 
determined. The predicted amino acid sequence for amino acids 2-19 (See Sequence B) derived from the 
DNA sequences of both the genomic and cDNA clones agreed perfectly with the first 18 amino acids of the 
30 above determined amino acid sequence (Sequence A) for isolated Pichia pastoris alcohol oxidase: 



Predicted amino acid 



sequence: 




Met 


ala 


iie 


pro 


glu 


glu 


phe 




Nucleotide sequence 


5' 


-ATG 


GCT 


ATC 


CCC 


GAA 


GAG 


TTT 




(pPC 8.3 and pPG 4. 


0) :3' 


-TAC 


CGA 


TAG 


GGG 


CTT 


CTC 


AAA 




asp He leu val 


leu 


gly 


gly 


gly 


ser 


ser 


gly 


ser 




GAT ATC CTA GTT 


CTA 


GGT 


GGT 


GGA 


TCC 


AGT 


GGA 


TCC- 


y 


CTA TAG GAT CAA 


GAT 


CCA 


CCA 


CCT 


AGG 


TCA 


CCT 


AGG- 


y 



Sequence B 

In addition, the entire nucleotide sequence for the coding region of the alcohol oxidase gene was 
determined. The nucleotide sequence determined and the predicted amino acid sequence are set forth in 
60 Sequence B* and are believed to be: 
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Predicted amino acid 





sequence : 








Met 


ai a 


iie 


pro 


gri \2 


giu 


phe 


5 


Nucleotide sequence 
(pPC 8.3 and pPG 4. 


5' 


-ATG 

-TAP 


GCT 


ATC 

TAH 


CCC 


GAA 

PTT 


GAG 

PTP 


TTT 

AAA 




asp 


iie 


leu 


val 


ieu 


gig 


gig 


gig 


ser 


ser 


gig 


ser 


70 


GAT 
CTA 


ATC 
TAG 


CTA 
GAT 


GTT 
CAA 


CTA 

HAT 
I? AX 


GGT 

PPA 


GGT 

PPA 
LA 


GGA 

PPT 


TTC 

APl'I 
A\jO 


AGT 

TP A 


GGA 

PPT 


TCC 
apt: 




cys 


iie 


ser 


gly 


arg 


leu 


ala 


asn 


ieu 


asp 


his 


ser 


15 


TGT 
ACA 


ATT 
TAA 


TCC 
AGG 


GGA 
CCT 


AGA 

TPT 


TTG 

AAP 


GCA 

PPT 


AAC 


TTG 

AAP 


GAC 

PTP 
l—X Vj 


CAC 
ptp 


TCC 




leu 


lys 


val 


gly 


ieu 


iie 


giu 


ala 


gig 


giu 


asn 


gin 


20 


TTG 
AAC 


AAA 
TTT 


GTT 
CAA 


GGT 
CCA 


CTT 
pa a 

l*AA 


ATC 

TAP 


GAA 

PTT 


GCA 


GGT 

LLA 


GAG 


AAC 
TTG 


CAA 
GTT 


pro 


gin 


gin 


pro 


met 


gig 


ieu 


pro 


ser 


arg 


tyr 


ieu 




CCT 
GGA 


CAA 
GTT 


CAA 
GTT 


CCC 
GGG 


ATG 

TAP 


GGT 

PPA 
UUA 


CTA 

PAT 
IvAX 


CCT 

PPA 


TCC 

APP 


AGG 

TPP 
ILL 


TAT 

ATA 


TTA 

AAT 
nnl 


25 


pro 


lys 


lys 


gin 


lys 


ieu 


asp 


ser 


lys 


tAr 


aia 


ser 




CCC 
GGG 


AAG 
TTC 


AAA 
TTT 


CAG 
GTC 


AAG 


TTG 

AAP 
AAC 


GAC 

Liu 


TCC 

AOO 


AAG 

X XL 


ACT 

TP A 


GCT 

PPA 
tun 


TCC 

APP 


30 


phe 


tyr 


thr 


ser 


asn 


pro 


ser 


pro 


his 


ieu 


asn 


gig 


TTC 
AAG 


TAC 
ATG 


ACT 
TGA 


TCT 
AGA 


AAC 

TTP 
X X lj 


CCA 


TCT 

Ar* a 
al»a 


CCT 

f^r* a 
wn 


CAC 

PTP 


TTG 

AAP 


AAT 

TTA 
X In 


GGT 

PPA 




arg 


arg 


ala 


iie 


val 


pro 


cys 


aia 


asn 


val 


ieu 


giy 


35 


AGA 
TCT 


AGA 
TCT 


GCC 
CGG 


ATC 
TAG 


GTT 
CAA 


CCA 
GGT 


TGT 
ACA 


GCT 
CGA 


AAC 
TTG 


GTC 
CAG 


TTG 
AAC 


GGT 
CCA 




gly 


gly 


ser 


ser 


iie 


asn 


phe 


met 


met 


tyr 


tAr 


arg 


40 


GGT 
CCA 


GGT 
CCA 


TCT 
AGA 


TCT 
AGA 


ATC 
TAG 


AAC 
TTG 


TTC 
AAG 


ATG 
TAC 


ATG 
TAC 


TAC 
ATG 


ACC 
TGG 


AGA 
TCT 




gly 


ser 


ala 


ser 


asp 


ser 


asp 


asp 




gin 


aia 


giu 




GGT 
CCA 


TCT 
AGA 


GCT 
CGA 


TCT 
AGA 


GAT 
CTA 


TCT 
AGA 


GAT 
CTA 


GAC 
CTG 


TTN 
AAN 


CAA 
GTT 


GCC 
CGG 


GAG 
CTC 


45 


gly 


ser 


lys 


thr 


giu 


asp 


ieu 


ieu 


pro 


ieu 


met 


lys 




GGC 
CCG 


TCG 
AGC 


AAA 
TTT 


ACA 
TGT 


GAG 
CTC 


GAC 
CTG 


TTG 
AAC 


CTT 
GAA 


CCA 
GGT 


TTG 
AAC 


ATG 
TAC 


AAA 
TTT 



50 



55 
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lys 


thr 


glu 


tAr 


tyr 


gin 


arg 


aia 


7 


gin 


7 


tgr 




AAG 

XXV. 


ACT 
TGA 


GAG 
CTC 


ACC 
TGG 


TAC 
ATG 


CAA 
GTT 


AGA 

TCT 


GCT 


TGN 
ACN 


CAA 
GTT 


CNA 

CNT 
un x 


TAC 

nlu 


5 


pro 


asp 


iie 


iiis 


gly 


phe 


giu 


gig 


pro 


iie 


igs 


vai 




CCT 

PIPIA 


GAC 
rTr 


ATT 
TAA 


CAC 


GGT 
CCA 


TTC 

Air 


GAA 
CTT 


GGT 


CCA 

L»w X 


ATC 
TAG 


AAG 
TTC 


GTT 


10 


ser 


phe 


git/ 


asn 


tyr 


tAr 


tyr 


pro 


vai 


cgs 


gi/j 


asp 


TCT 

APIA 


TTC 

A HP 


GGT 

PP* A 


AAC 

J. lu 


TAC 

ATP 1 
/i X Li 


ACC 

X uO 


TAC 


CCA 


GTT 

p* a & 


TGC 

ACL? 


CAG 

Pi TP 


GAC 
CxG 




phe 


ieu 


arg 


aia 


ser 


giu 


ser 


giii 


gig 


iie 


pro 


tgr 


15 


TTC 


TTG 

AAL> 


AGG 

ILL. 


GCT 


TCT 

AL?A 


GAG 

Lit 


TCC 


CAA 

GTT 


GGT 

CCA 


ATT 

TAA 


CCA 

CGT 


TAC 
ATG 




vai 


asp 


asp 


ieu 


giu 


asp 


ieu 


vai 


ieu 


thr 


his 


gig 




GTT 
CAA 


GAC 


GAT 

CIA 


CTG 

UAL. 


GAA 
CTT 


GAC 
CTG 


TTG 
AAC 


GTA 

CAT 


CTG 
GAC 


ACT 
TGA 


CAC 
GTG 


GGT 
CCA 


20 


aia 


glu 


his 


trp 


ieu 


iys 


trp 


iie 


asn 


arg 


asp 


tAr 




GCT 
p»p* a 


GAA 

L X J. 


CAC 


TGG 

r\LL 


TTG 

nnL 


AAG 
TTC 


TGG 

ACC 


ATC 


AAC 

x xL» 


AGA 

JLCx 


GAC 

CxL* 


ACT 

TP* A 
XL* A 




gly 


arg 


arg 


ser 


asp 


ser 


aia 


his 


aia 


phe 


vai 


his 


25 


CGT 

P'P' A 


CGT 


TCC 


GAC 

P"TT~ 
I. J. Lj 


TCT 
ap* a 


GCT 

P*P* A 


CAT 

bXA 


GCA 


TTT 

AAA 

AAA 


GTC 

pur 
UAu 


CAC 
GTG 


TCT 

ACA 




ser 


tAr 


met 


arg 


asn 


his 


asp 


as/3 


ieu 




ieu 


iie 


30 


TCT 

AHA 


ACT 
TnA 


ATG 

TAP 


AGA 

TP*T 
X l> X 


AAC 

TTPi 
X Xw 


CAC 

PiTPi 

IrX w 


GAC 


AAC 
TTG 


TTG 

A AP* 


TAC 

A TP* 


TTG 
a ar 


ATC 

TAP* 




cys 


as/3 


thr 




vai 


asp 


iys 


iie 


iie 


vai 


giu 


asp 




TGT 
ACA 


AAC 

TTG 


ACG 
TGC 


AAG 

TTP 


GTC 
CAG 


GAC 

PTC 


AAA 


ATT 

TAA 


ATT 

TAA 


GTC 

PAPi 


GAA 

l» X X 


GAC 

PTPi 
i# x o 


35 


giy 


arg 


aia 


aia 


aia 


vai 


arg 


tAr 


vai 


pro 


ser 


Jgs 




GGA 
CCT 


AGA 
TCT 


GCT 
CGA 


GCT 
CGA 


GCT 
CGA 


GTT 
CAA 


AGA 
TCT 


ACC 
TGG 


GTT 
CAA 


CCA 
GGT 


AGC 
TCG 


AAG 
TTC 




pro 


leu 


asn 


pro 


lys 


iys 


pro 


ser 


his 


Jgs 


iie 


tgr 


40 


CCT 
GCA 


TTG 
AAC 


AAC 
TTG 


CCA 
GGT 


AAG 
TTC 


AAG 
TTC 


CCA 
GGT 


AGT 
TCA 


CAC 
GTG 


AAG 
TTC 


ATC 
TAG 


TAC 
ATG 




arg 


aia 


arg 


iys 


gin 


iie 


phe 


ieu 


ser 


cgs 


gig 


thr 


45 


CGT 
GCA 


GCT 
CGA 


AGA 
TCT 


AAG 
TTC 


CAA 
GTT 


ATC 
TAG 


TTT 
AAA 


TTG 
AAC 


TCT 
AGA 


TGT 
ACA 


GGT 
CCA 


ACC 
TGG 


iJe 


ser 


ser 


pro 


ieu 


vai 


ieu 


gin 


arg 


ser 


gig 


phe 




ATC 
TAG 


TCC 
AGG 


TCT 
AGA 


CCA 
GGT 


TTG 
AAC 


GTT 
CAA 


TTG 
AAC 


CAA 
GTT 


AGA 
TCT 


TCC 
AGG 


GGT 
CCA 


TTT 
AAA 



50 
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gly 


asp 


pro 


iie 


lys 


ieu 


arg 


ala 


aia 


griu 


vai 


ius 




GGT 
CCA 


GAC 
CTG 


CCA 
GGT 


ATC 
TAG 


AAG 
TTC 


TTG 
AAC 


AGA 
TCT 


GCC 
CGG 


GCT 
CGA 


GGT 
CCA 


GTT 
CAA 


AAG 
TTC 


5 


pro 


ieu 


val 


asii 


ieu 


pro 


giy 


vai 


o\Zy 


argr 


asn 


phe 




CCT 
GGA 


TTG 
AAC 


GTC 
CAG 


AAC 
TTG 


TTG 
AAC 


CCA 
GGT 


GGT 
CCA 


GTC 
CAG 


GGA 
CCT 


AGA 
TCT 


AAC 
TTG 


TTC 
AAG 




gin 


asp 


his 


tyr 


Ct/S 


phe 


phe 


ser 


pro 


tyr 


arg 


iie 


10 


CAA 
GTT 


GAC 
CTG 


CAT 
GTA 


TAT 
ATA 


TGT 
ACA 


TTC 
AAG 


TTC 
AAG 


AGT 
TCA 


CCT 
GGA 


TAC 
ATG 


AGA 
TCT 


ATC 
TAG 




lys 


pro 


gin 


tyr 


glu 


ser 


phe 


asp 


asp 


phe 


vai 


arg 


15 


AAG 
TTC 


CCT 
GCA 


CAG 
GTC 


TAC 
ATG 


GAG 
CTC 


TCT 
AGA 


TTC 
AAG 


GAT 
CTA 


GAC 
CTG 


TTC 
AAG 


GTC 
CAG 


CGT 
GCA 


gly 


asp 


ala 


giu 


iie 


gin 


lys 


argr 


vai 


vai 


asp 


gin 




GGT 
CCA 


GAT 
CTA 


GCT 
CGA 


GAG 
CTC 


ATT 
TAA 


CAA 
GTT 


AAG 
TTC 


AGA 
TCT 


GTC 
CAG 


GTT 
CAA 


GAC 
CTG 


CAA 
GTT 


20 


trp 


tur 


ala 


asn 


griu 


tAr 




pro 


ieu 


aia 


tAr 


asn 




TGG 
ACC 


TAC 
ATG 


GCC 
CGG 


AAT 
TTA 


GGT 
CCA 


ACT 
TGA 


GGT 
CCA 


CCT 
GGA 


CTT 
GAA 


GCC 
CGG 


ACT 
TGA 


AAC 
TTG 




gly 


iie 


glu 


aia 




val 


lys 


iie 


argr 


pro 


thr 


pro 


25 


GGT 
CCA 


ATC 
TAG 


GAA 
CTT 


GCT 
CGA 


GGT 
CCA 


GTC 
CAG 


AAG 
TTC 


ATC 
TAG 


AGA 
TCT 


CCA 
GGT 


ACA 
TGT 


CCA 
GGT 




glu 


giu 


ieu 


ser 


gin 


met 


asp 


glu 


ser 


phe 


gin 


griu 




GAA 
CTT 


GAA 
CTT 


CTC 
GAG 


TCT 
AGA 


CAA 
GTT 


ATG 
TAC 


GAC 
CTG 


GAA 
CTT 


TCC 
AGG 


TTC 
AAG 


CAG 
GTC 


GAG 
CTC 


30 


gly 


tyr 


arg 


glu 


tur 


phe 


glu 


asp 


lys 


pro 


asp 


lys 




GGT 
CCA 


TAC 
ATG 


AGA 
TCT 


GAA 
CTT 


TAC 
ATG 


TTC 
AAG 


GAA 
CTT 


GAC 
CTG 


AAG 
TTC 


CCA 
GGT 


GAC 
CTG 


AAG 
TTC 




pro 


val 


met 


Ais 


tyr 


ser 


iie 


iie 


aia 


giy 


phe 


phe 


35 


CCA 
GGT 


GTT 
CAA 


ATG 
TAC 


CAC 
GTG 


TAC 
ATG 


TCC 
AGG 


ATC 
TAG 


ATT 
TAA 


GCT 
CGA 


GGT 
CCA 


TTC 
AAG 


TTC 
AAG 




gly 


asp 


his 


thr 


lys 


iie 


pro 


pro 


gly 


lys 


tyr 


met 


40 


GGT 
CCA 


GAC 
CTG 


CAC 
GTG 


ACC 
TGG 


AAG 
TTC 


ATT 
TAA 


CCT 
GGA 


CCT 
GGA 


GGA 
CCT 


AAG 
TTC 


TAC 
ATG 


ATG 
TAC 




thr 


met 


phe 


his 


phe 


ieu 


giu 


tyr 


pro 


phe 


ser 


arg 




ACT 
TGA 


ATG 
TAC 


TTC 
AAG 


CAC 
GTG 


TTC 
AAG 


TTG 
AAC 


GAA 
CTT 


TAC 
ATG 


CCA 
GGT 


TTC 
AAG 


TCC 
AGG 


AGA 
TCT 


45 


gly 


ser 


iie 


his 


iie 


thr 


ser 


pro 


asp 


pro 


tyr 


aia 




GGT 
CCA 


TCC 
AGG 


ATT 
TAA 


CAC 
GTG 


ATT 
TAA 


ACC 
TGG 


TCC 
AGG 


CCA 
GGT 


GAC 
CTG 


CCA 
GGT 


TAC 
ATG 


GCA 
CGT 
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ala 


pro 


asp 


phe 


asp 


arg 


gly 


phe 


met 


asn 


asp 


glu 




GCT 
CGA 


CCA 
GGT 


GAC 
CTG 


TTC 
AAG 


GAC 
CTG 


CGA 
GCT 


GGT 
CCA 


TTC 
AAG 


ATG 
TAC 


AAC 
TTG 


GAT 
CTA 


GAA 
CTT 


5 


arg 


asp 


met 


aia 


pro 


met 


vai 


trp 


aia 


tyr 


lys 


ser 




AGA 
TCT 


GAC 
CTG 


ATG 
TAC 


GCT 
CGA 


CCT 
GGA 


ATG 
TAC 


GTT 
CAA 


TGG 
ACC 


GCT 
CGA 


TAC 
ATG 


AAG 
TTC 


TCT 
TTC 




ser 


arg 


glu 


tAr 


ala 


arg 


arg 


ser 


asp 


his 


phe 


aia 


10 


TCT 
AGA 


AGA 
TCT 


GAA 
CTT 


ACC 
TGG 


GCT 
CGA 


AGA 
TCT 


AGA 
TCT 


AGT 
TCA 


GAC 
CTG 


CAC 
GTG 


TTT 
AAA 


GCC 
CGG 




gly 


giu 


val 


thr 


ser 


his 


his 


pro 


ieu 


phe 


pro 


tyr 


15 


GGT 
CCA 


GAG 
CTC 


GTC 
CAG 


ACT 
TGA 


TCT 
AGA 


CAC 
GTG 


CAC 
GTG 


CCT 
GGA 


CTG 
GAC 


TTC 
AAG 


CCA 
GGT 


TAC 
ATG 




ser 


ser 


glu 


ala 


arg 


ala 


ieu 


giu 


met 


asp 


ieu 


giu 




TCA 
AGT 


TCC 
AGG 


GAG 
CTC 


GCC 
CGG 


AGA 
TCT 


GCC 
CGG 


TTG 
AAC 


GAA 
CTT 


ATG 
TAC 


GAT 
CTA 


TTG 
AAC 


GAG 
CTC 


20 


thr 


ser 


asn 


ala 


tyr 


gly 


gly 


pro 


ieu 


a Si} 


ieu 


ser 




ACC 
TGG 


TCT 
AGA 


AAT 
TTA 


GCC 
CGG 


TAC 
ATG 


GGT 
CCA 


GGA 
CCT 


CCT 
GGA 


TTG 
AAC 


AAC 
TTG 


TTG 
AAC 


TCT 
AGA 




ala 


giu 


leu 


ala 


his 


g2g 


ser 


trp 


thr 


gi-n 


pro 


leu 


25 


GCT 
CGA 


GGT 
CCA 


CTT 
GAA 


GCT 
CGA 


CAC 
GTG 


GGT 
CCA 


TCT 
AGA 


TGG 
ACC 


ACT 
TGA 


CAA 
GTT 


CCT 
GGA 


TTG 
AAC 




lys 


lys 


pro 


thr 


aia 


lys 


asn 


glu 


gly 


his 


vai 


thr 


30 


AAG 
TTC 


AAG 
TTC 


CCA 
GGT 


ACT 
TGA 


GCA 
CGT 


AAG 
TTC 


AAC 
TTG 


GAA 
CTT 


. GGC 
CCG 


CAC 
GTG 


GIT 
CAA 


ACT 
TGA 


ser 


asn 


gin 


val 


glu 


leu 


his 


pro 


asp 


iie 


giu 


tyr 




TCG 
AGC 


AAC 
TTG 


CAG 
GTC 


GTC 
CAG 


GAG 
CTC 


CTT 
GAA 


CAT 
GTA 


CCA 
GGT 


GAC 
CTG 


ATC 
TAG 


GAG 
CTC 


TAC 
ATG 


35 


asp 


glu 


glu 


asp 


asp 


lys 


aia 


He 


glu 


asn 


tyr 


iie 


GAT 
CTA 


GAG 
CTC 


GAG 
CTC 


GAT 
CTA 


GAC 
CTG 


AAG 
TTC 


GCC 
CGG 


ATT 
TAA 


GAG 
CTC 


ACC 
TTG 


TAC 
ATG 


ATT 
TAA 




arg 


glu 


Ais 


thr 


glu 


tAr 


thr 


trp 


his 


cgs 


ieu 


gly 


40 


CGT 
GCA 


GAG 
CTC 


CAC 
GTG 


ACT 
TGA 


GAG 
CTC 


ACC 
TGG 


ACA 
TGT 


TGG 
ACC 


CAC 
GTG 


TGT 
ACA 


CTG 
CCA 


GGA 
GGT 




thr 


Ct/S 


ser 


He 


gig 


pro 


arg 


glu 


gly 


ser 


lys 


iie 




ACC 
TGG 


TGT 
ACA 


TCC 
AGG 


ATC 
TAG 


GGT 
CCA 


CCA 
GGT 


AGA 
TCT 


GAA 
CTT 


GGT 
CCA 


TCC 
AGG 


AAG 
TTC 


ATC 
TAG 


45 


val 


lys 


trp 


gig 


gig 


vai 


leu 


asp 


his 


arg 


ser 


asn 




GTC 
CAG 


AAA 
TTT 


TGG 
ACC 


GGT 
CCA 


GGT 
CCA 


GTT 
CAA 


TIG 
AAC 


GAC 
CTG 


CAC 
GTG 


AGA 
TCT 


TCC 
AGG 


AAC 
TTG 
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vai 


tyr 


gly 


vai 




gly 


leu 


lys 


vai 


gly 




ieu 


GTT 


TAC 


GGA 


GTC 


AAG 


GGC 


TT CI 


Air* 






GAC 


TTG 


CAA 


ATG 


CCT 


CAG 


TTC 


CCG 


AAC 


TTC 


CAA 


CCA 


CTG 


AAC 


ser 


vai 


cys 


pro 


asp 


asn 


vai 


gly 


cys 


asn 


tAr 


t XI T 


TCC 


GTG 


TGC 


CCA 


GAC 


AAT 


GTT 




TGT 


AAC 


ACC 


TAC 


AGG 


CAC 


ACG 


GGT 


CTG 


TTA 


CAA 


CCA 


ACA 


TTG 




ATn 


thr 


tAr 


ala 


ieu 


leu 


He 


gly 

37 * 27 


olu 


lys 


t/*r 


ala 




ACC 


ACC 


GCT 


PTT 




r\ J. L. 




GAA 


AAG 


ACT 


GCC 


ACT 


TGG 


TGG 


CGA 


GAA 


AAC 


TAG 


CCA 


CTT 






CGG 


TGA 


leu 


vai 


gly 


giu 


asp 


leu 




tyr 


ser 




giu 


ala 


TTG 


GTT 


GGA 


GAA 


CAT 


TTA 


GGA 


TAC 


TCT 


GGT 


GAG 


GCC 


AAC 


CAA 


CCT 


CTT 


CTA 


AAT 


CCT 


ATG 


AGA 


CCA 


CTC 


CGG 


leu 


asp 


met 


tAr 


vai 


pro 


gin 


phe 


lys 


leu 


gly 


thr 


TTA 


GAC 


ATG 


ACT 


GTT 


CCT 


CAG 


TTC 


AAG 


TTG 


GGC 


ACT 


AAT 


CTG 


TAC 


TGA 


CAA 


GGA 


GTC 


AAG 


TTC 


AAC 


CCG 


TGA 


tyr 


giu 


lys 


thr 


gly 


leu 


a Ja 


arg 


phe 


stop 






TAC 


GAG 


AAG 


ACC 


GGT 


CTT 


GCT 


AGA 


TTC 


TAA- 


3' 




ATG 


CTC 


TTC 


TGG 


CCA 


GAA 


CGA 


TCT 


AAG 


ATT- 


5* 





Sequence B' 

A comparison of the above nucleotide sequence with the published (Ledeboer et al.) nucleotide 
sequence for the previously described alcohol oxidase from Hansenuta polymorphs reveals numerous 
significant differences, including the predicted amino acid sequence, the actual size of the gene (and the 
resulting protein), codon usage bias, and the like. 

Identification of p76 as Pi hydroxy acetone Synthase 

The nucleotide sequence for the first 51 nucleotides of the p76 gene was determined by standard 
techniques. From this sequence, the amino acid sequence for the amino terminal end of the p76 protein can 
be predicted: 



Amino acid sequence 




met 


ala 


arg 


iie 


pro 


lys 




Nucleotide sequence 


: 5' 


-ATG 


GCT 


AGA 


ATT 


CCA 


AAA 






3 1 


-TAC 


CGA 


TCT 


TAA 


GGT 


TTT 




pro vai ser thr 


gin 


asp 


asp 


iie 


his 


gly 


ieu 




CCA GTA TCG ACA 


CAA 


GAT 


GAC 


ATT 


CAT 


GAA 


TTG- 


3' 


GGT CAT AGC TGT 


GTT 


CTA 


CTG 


TAA 


GTA 


CTT 


AAC- 


5» 



This predicted amino acid sequence for p76 can be compared with the published amino acid sequence 
for the dihydroxyacetone synthase (DHAS) protein from Hansenuta polymorphs (Manowicz et al.). Although 
several differences in the sequences are apparent, there are similarities between the two proteins which can 
be discerned: 
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Pichia DHAS : met- a la- arg-ile-pro-lys-pro~ 
Hansenvla DHAS: met-ser-met-arg-ile~pro-lgs-ala- 

val-ser-thr-gln-asp-asp-ile-his-g2u- -leu- 
5 a ia- ser-val-asn-asp-glu-gln-his-gln-arg- He- 

Based on the significant degree of homology and the similar protein size (about 76,000 daltons) of 
Pichia p76 and Hansenula DHAS, p76 has been tentatively identified as DHAS from Pichia. 
io As above with the alcohol oxidase gene, a comparison of the nucleotide sequence for the first 51 
nucleotides of the Pichia DHAS protein with the previously published (Janowicz et al.) nucleotide sequence 
of Hansenula DHAS suggests numerous differences in codon usage bias, the" predicted amino acid 
sequence, the total size of the gene, etc. 

75 DNA Fragments Containing Regulatable Promoters from Pichia 
pastoris 

The 5' regulatory regions of the invention are detailed in restriction maps presented in Figures 4, 5 and 
20 6. The 5* regulatory region which controls the expression of polypeptide p76 is located within the DNA 
fragment depicted in Figure 4a. The approximately 2.9 kilobase pair Mndlll-X/>ol fragment has been clearly 
demonstrated to contain the regulatory function as detailed more fully below. Since cDNA clone pPC 1 5.0 is 
not a full copy cDNA, it is most likely that at least a portion of the DNA fragment depicted in Figure 4a 
includes structural coding sequences for polypeptide p76. Thus, the regulatory function is believed to reside 
25 in the approximately 1300 base pair Hin6U\-EcoH\ fragment shown in Figure 4b. Novel 0-galactosidase gene 
containing constructs, to be discussed in greater detail below, support this suggestion. 

The 5' regulatory region which controls the expression of polypeptide p72 (alcohol oxidase) is located 
within the approximately 2000 base pair EcoRI-EcoRV DNA fragment illustrated in Figure 5. Novel 0- 
galactosidase gene containing constructs discussed below demonstrate the regulatable nature of this DNA 
30 fragment. 

Figure 6 provides a restriction map for the approximately 3 kilobase pair BamH\-Sal\ DNA fragment 
which includes the 5' regulatory region which controls the production of polypeptide p40. This fragment is 
clearly distinguishable from the 5* regulatory regions detailed in Figures 4 and 5 based, inter alia, on the 
different restriction sites located within the DNA fragment. 

35 Figures 10, 2a and 11 provide restriction enzyme data for the regulatory regions plus structural genes 
for polypeptides p76. p72(alcohol oxidase) and p40. respectively. Hence, Figure 10 provides detail for the 
3.8 kilobase pair Hind\\\-P$t\ fragment of Pichia pastoris genomic DNA which controls and codes for the 
production of polypeptide p76. Figure 2a deals with the 4.0 kilobase pair EcoR\-Pvu\\ fragment of Pichia 
pastoris genomic DNA which controls and codes for the production of polypeptide p72(alcohol oxidase). 

40 Figure 11 presents the 3.7 kilobase pair BamH\-EcoHV fragment of Pichia pastoris genomic DNA which 
controls and codes for the production of polypeptide p40. 

The genomic clones, pPG 6.0, pPG 4.0 and pPG 4.8 have also been characterized by restriction 
mapping. Thus, clone pPG 6.0 is detailed in Figure 1a. As a point of reference the 5* end of the DNA 
fragment is deemed the origin. Clone pPG 6.0 is a tf/ndlll fragment of Pichia pastoris chromosomal DNA 

45 which is about 6 kilobase pairs in length, and is cleaved as follows by various restriction enzymes: 



Restriction Enzyme 


Cleavage 


Distance From Origin (bp) 




Sites 




Hindi 


5 


1070, 1740, 1890, 






3320, 5520 


EcoR\ 


2 


1300, 3450 


Xho\ 


1 


2860. 


Pst\ 


2 


3820, 4200 


Pvu\\ 


1 


4120 


PviA 


1 


4950 
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Clone pPG 4.0 is illustrated in detail in Figur 2a. The clone is an EcoRI-W/ndlll fragment of 
chromosomal DNA which is about 4 kilobase pairs in length. Referring to the 5* end of the clone as the 
origin, the following restriction data was obtained for pPG 4.0: 



5 


Restriction Enzyme 


Cleavage 
Sites 


Distance From Origin (bp) 




H/ndlll 


3 


400, 600. 1840 




Psfl 


1 


850 


70 


BamH\ 


2 


1960, 1970 




San 


1 


2620 




BglU 


2 


1040, 2700 




Kpn\ 


2 


500, 2730 




Xba\ 


1 


3330 


75 


Stu\ 


1 


3880 




Nde\ 


1 


420 




HincW 


2 


870; 2430 




Sstt 


1 


1200 




Bc/\ 


2 


1710, 4080 


20 


AsuU 


2 


1900, 2300 




EcoRV 


1 


1930 




PviA\ 


1 


4120 



Clone pPG 4.8 is illustrated in detail in Figure 3a. The clone is a 4.8 kilobase pair BamH\-EcoH\ 
fragment of Pichia pas ton's chromosomal DNA with the following additional restriction sites: 



Restriction Enzyme 


Cleavage 
Sites 


Distance From Origin (bp) 


C/al 


1 


410 


Kpn\ 


3 


500. 3890, 4280 


PvtA 


1 


1120 


San 


1 


2900 


PvuU 


1 


4135 


EcoRV 


2 


3690, 3690 


Bgn\ 


1 


4500 


Xma\ 


1 


4800 



40 The genomic clones pPG 6.0. pPG 4.0 and pPG 4.8 were manipulated by insertion into unique 
restriction sites on the £. coli plasmid pBR322. Clone pPG 6.0, which is a H/ndlll fragment, was 
conveniently cloned into the H/ndlll site of pBR322. Clone pPG 4.0 was cloned into the EcoR\-Pvti\\ sites of 
pBR322 and clone pPG 4.8 was cloned into the EcoRI-BamHI sites of pBR322. (See Example VI). E. coli 
strains transformed with these plasmids have been deposited with the Northern Regional Research Center, 

45 Peoria, Illinois, to insure free access to the public upon issuance of a patent on this application. The 
deposited strains have been given the following accession numbers: 



Genomic Class 


Laboratory Designation 


Accession No. 


pPG 6.0 
pPG 4.0 
pPG 4.8 


LE392-pPG 6.0 
LE392-pPG 4.0 
LE392-pPG 4.8 


NRRL B-15867 
NRRL B-15868 
NRRL B-15869 



Figures 7. 8 and 9 provide restriction map data for the 3 f regulatory regions of the polypeptides p76, 
p72 (alcohol oxidas ) and p40, respectively. The 3* regulatory regions are useful in controlling the 
polyadenylation. termination of transcription and termination of translation of mess nger RNA which is 
coded for by preceding nucleotide sequences. Thus, the 3' regulatory region from the polypeptide p76 
gene, a 2.7 kilobase pair EcoRI-W/ndlll fragment illustrated in Figur 7, is useful in controlling the 
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poiyadenylation as welt as termination of transcription and termination of translation of the mRNA which 
codes for polypeptide p76, or any other mRNA derived from a gene inserted upstream of the 3* regulatory 
region. The 0.2 kilobase pair Stu\-Pvu\\ fragment from the p72 gene detailed in Figure 8a, the 0.3 kilobase 
pair Stu\-Hln6\\\ fragment from the p72 gene detailed in Figure 8b, the 3.2 kilobase pair Sal\-EcoR\ fragment 
s from the p72 gene and the 1.9 kilobase pair Sall-EcoRI fragment from the p40 gene detailed in Figure 9 
have similar utility, both with respect to the structural genes with which they are associated in the wild type 
Plchia pas tori $ and any foreign (i.e. heterologous) genes which may be inserted upstream of these 3' 
regulatory regions. 

Since the alcohol oxidase gene in pPG4.0 terminates within a few hundred base pairs of the AO gene 
io transcription termination site, the additional 3' sequence detailed in Figure 8c was obtained as follows. The 
first step was to digest Pichia chromosomal DNA with EcoRI and Sail and hybridize the digested DNA with 
a 2.0 kbp ^P-labelled BamHI-Hindlll fragment from theAO gene by~the Southern blot method. Among the 
Pichia Eco RI-Sall digestion fragments which hybridized with the AO gene probe was a 3.2 kbp fragment 
which encodes the 3' portion of the AO gene and sequences flanking the 3' terminus of the gene. 
75 The 3* AO gene fragment was then cloned by recovering EcoRI-Sall-cut Pichia DNA fragments of about 
3.2 kbp by gel elution and inserting the fragments into EcoRI "and Sail-digested pBR322. Finally, a 
recombinant plasmid, pPG3.2, which contains the 3' AO gene fragment was identified by colony hybridiza- 
tion using the labelled AO gene fragment as probe. An E. coli strain transformed with plasmid pPG3.2 has 
been deposited with the Northern Regional Research Center, Peoria, Illinois. The deposited strain has been 
20 assigned accession number NRRL B-15999. Figure 8c shows a restriction endonuclease cleavage site map 
of the Pichia DNA fragment from pPG3.2. The fragment contains about 1 .5 kbp encoding the 3' portion of 
the AO (from Sail to Hindlll) and about 1 .7 kbp of sequence 3' of the AO gene. 

Characterization of cDNA Clones 

25 

The cDNA clones for the regulatable genes from Pichia pastoris have also been characterized by 
restriction mapping. In Figure 12, the p76 cDNA, a 1.1 kilobase pair fragment is detailed. Referring to the 5' 
end of the DNA sequence as the origin, restriction enzyme Xho\ cleaves p76 cDNA about 500 base pairs 
from the origin, HincH cleaves about 950 base pairs from the origin and EcoRI cleaves p76 cDNA about 

30 1050-1100 base pairs from the origin. The cDNA clone shown in Figure 12, as well as the cDNA clones 
shown in Figures 13 and 14 are all shown with Psfl termini. These are artificially created restriction sites 
produced by G-C tailing of the initially obtained complementary DNA to facilitate cloning of the DNA 
fragments into pBR322. Based on Northern hybridization studies and the size of the polypeptide product, it 
is estimated that the cDNA clone pPC 15.0 is an incomplete copy of p76 mRNA, representing only about 

35 half of the total messenger RNA sequence. 

In Figure 13, a composite restriction map for p72 (alcohol oxidase) cDNA, constructed by overlap of 
clones pPC 6.3 and pPC 8.0, is presented. As above, the 5' end of the DNA sequence is referred to as the 
origin. Thus, treating alcohol oxidase cDNA with a variety of restriction enzymes gives the following size 
fragments: 

40 



Restriction Enzyme 


Cleavage 
Sites 


Distance From Origin (bp) 


Asm 


2 


20, 420 


EcoRV 


1 


50 


fla/nHI 


2 


80, 90 


HincW 


1 


550 


Sail 


1 


820 


BglU 


1 


820 


Kpn\ 


1 


850 


Xba\ 


1 


1450 


Rsa\ 


1 


1760 


Stat 


1 


2000 



Restriction enzyme mapping of the 3' nd of th alcohol oxidas gene in clon s pPC 8.0 and pPC 8.3 
revealed that cDNA clone pPC 8.3 is missing approximately 250 nucleotides of the alcohol oxidas mRNA 
sequence (Figure 2). The sequences present at the 3' end of the alcohol oxidase mRNA are present in 
cDNA clone pPC 8.0 which overlaps pPC 8.3 by approximately 500 nucleotides. 
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Figure 14 presents a r striction map for the cDNA of polypeptide p40, a 1.2 kilobas pair fragm nt. 
Referring to the 5* end of the cDNA clone as the origin, clone pPC 6.7 is cleaved by Sa/I (and HincW) about 
1 000 bases from the origin. 

Each of the cDNA fragments have been cloned into pBR322 t which is then transformed into E. coll. The 
5 transformed strains have been deposited with the Northern Regional Research Center in Peoria, Illinois. The 
deposited strains have been assigned the following accession numbers: 



cDNA clone 


Laboratory Description 


Accession No. 


pPC 15.0 
pPC 8.3 
pPC 8.0 
pPC 6.7 


LE392-pPC15.0 
LE392-pPC 8.3 
MM294-pPC 8.0 
LE392-pPC 6.7 


NRRL B-15870 
NRRL B-15871 
NRRL B-15873 
NRRL B-15872 



Each of the above-described cDNA clones are useful as probes for the identification and isolation of 
chromosomal DNA encoding the production of polypeptides unique to the growth of yeast on methanol as a 
carbon and energy source. Hence as already described, these clones were used to identify P. pas tort's 
chromosomal DNA fragments containing the regulatory regions and structural coding information for the 
unique polypeptides which are observed when P. pastoris is grown on methanol. In a similar fashion, these 
cDNA clones have utility as probes for the identification and isolation of analogous genes from other 
methanol assimilating yeasts such as, for example, Torulopsis molischiana, Hansenula capsulatum, H. 
nonfermantens and the like (See Example XVII). 



Detailed Analysis of the Alcohol Oxidase Gene 

The 5* regulatory region of clone pPG 4.0 was further characterized by determining the nucleotide 
sequence of the clone upstream (5*) of the point where the structural information for p72 (alcohol oxidase) is 
encoded. The first 250 nucleotides prior to the mRNA translation start site (ATG codon) are believed to be: 



30 



40 



S / "ATGCTTCCAA 
ACGTTCATGA 
GACAGGCAAT 
AACCTTTTTT 
TTGCGACTGG 
CTTTTAACGA 
ATTCGAAACG" 3 7 



GATTCTGGTG 
TCAAAATTTA 
ATATAAACAG 
TTTATC ATC A 
TTCCAATTGA 
CAACTTGAGA 



GGAATACTGC 
ACTGTTCTAA 
AAGGAAGCTG 
XTATTAGCTT 
CAAGCTTTTG 
AG AT C AAAAA 



TGATAGCCTA 
CCCCTACTTG 
CCCTGTCTTA 
ACTTTCATAA 
ATTTTAACGA 
ACAACTAATT 



Secjuence C 



The promoter function of clone pPG 4.0 is believed to be contained within this sequence of nucleotide 
45 bases. 

In order to more fully describe this novel DNA fragment, an additional 301 nucleotides further upstream 
of the sequence detailed in Sequence C above have been determined. Thus, the first 551 nucleotides prior 
to the mRNA translation start site are believed to be: 
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& ^ tk ^*^n*f**f* 


AAALuL Xu X 


TTGGAACCTA 


nlnl uAwnAA 


AbL I* X VitA X U X 


ft ww%f%f% ft k j+ ft m 


^ K It /»m ft ft M|m 

GAACTAAGTT 




AnnlLLlAAL 




IHM ft ft ft m ft m at » 

TCAAAAAGAA 






CGTTTGTCTT 


GTTTGGTATT 


uAlXliALljnA 


lliL X uAAAAA 


TAATCTCATT 


ft ft im'i/WWfti ft *m0m 

AATGCTTAGC 


GCAGTCTCTC 


TATCGCTTCT 


GAACCCGGTG 


GCACCTGTGC 


CGAAACGCAA 


ATGGGGAAAC 


AACCCGCTTT 


TTGGATGATT 


ATGCATTGTC 


CTCCACATTGT 


ATGCTTCCAA 


GATTCTGGTG 


GGAATACTGC 


TGATAGCCTA 


ACGTTCATGA 


TCAAAATTTA 


ACTGTTCTAA 


CCCCTACTTG 


GACAGGCAAT 


ATATAAACAG 


AAGGAAGCTG 


CCCTGTCTTA 


AACCTTTTTT 


TTTATCATCA 


TTATTAGCTT 


ACTTTCATAA 


TTGCGACTGG 


TTCCAATTGA 


CAAGCTTTTG 


ATTTTAACGA 


CTTTTAACGA 


CAACTTGAGA 


AGATCAAAAA 


ACAACTAATT 


ATTCGAAACG" 


3'. 



Sequence D 



The additional nucleotides contained in Sequence D (compared to Sequence C) are believed to impart, by 
an unknown mechanism, additional regulatory functions to the promoter region contained within Sequence 
C. It should be recognized that Sequence D represents only partial DNA sequencing for the 1 .1 kbp DNA 
fragment shown in Examples XIV and XV to be capable of controlling gene expression in yeast. It may be 
that additional control functions are encoded in the portion of the 1.1 kbp DNA fragment not detailed in 
Sequence D. 

In order to further describe this novel 1.1 kbp DNA fragment additional nucleotide sequencing was 
carried out to fully delineate the nucleotide sequence of the entire 1.1 kbp DNA fragment shown in 
Examples XIV and XV to be capable of controlling gene expression in yeast. The nucleotide sequence is set 
forth as Sequence D': 
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CGAAAGGTTG 
AGGTCCATTC 
GGATACACTA 
CTCTTCTCTA 
GGGCTTGATG 
GCTACTAACA 
CCCCCTGGCG 
AAGCTCCGCA 
TTTCTGAGTG 

GCCCAAACT 

ATGACAAAAG 

GTTCGTTGAA 

TTCCAAAAGT 

TTGACGAATG 

AGTCTCTCTA 

AAACGCAAAT 

GCATTGTCCT 

GAATACTGCT 

CTGTTCTAAC 

AGGAAGCTGC 

TATTAGCTTA 

AAGCTTTTGA 

GATCAAAAAA 



AATGAAACCT 
TCACACATAA 
GCAGCAGACG 
ACACCATTTT 
GAGCTCGCTC 
CCATGACTTT 
AGGTCATGTT 
TTACACCCGA 
TGGGGTCAAA 
GACAGTTTAA 
CGTGATCTCA 
ATCCTAACGG 
CGCCATACCG 
CTCAAAAATA 
TCGCTTCTGA 
GGGGAAACAA 
CCACATTGTA 
GATAGCCTAA 
CCCTACTTGG 
CCTGTCTTAA 
CTTTCATAAT 
TTTTAACGAC 
CAACTAATTA 



5 ' -AGATCTAA 
TTTTGCCATC 
GTGCCAAACG 
TTGCAAACGC 
GCATGAAAAC 
ATTCCAATTC 
ATTAGCCTGT 
TGTTTATTTC 
ACATCACTCC 
TAGTTTCATG 
ACGCTGTCTT 
TCCAAGATGA 
CCAGTTGGTC 
TTTGTCTTGT 
ATCTCATTAA 
ACCCGGTGGC 
CCCGCTTTTT 
TGCTTCCAAG 
CGTTCATGAT 
ACAGGCAATA 
ACCTTTTTTT 
TGCGACTGGT 
TTTTAACGAC 
TTCGAAACG" 3 



CATCCAAAGA 
CGACATCCAC 
CAACAGGAGG 
AGGACTCATC 
AGCCAGTTAT 
CTTCTATTAG 
CTATCCTGGC 
CGAATGCAAC 
AGATGAGGGC 
TTCCCAAATG 
GGAACCTAAT 
ACTAAGTTTG 
AAAAAGAAAC 
TTGGTATTGA 
TGCTTAGCGC 
ACCTGTGCCG 
GGATGATTAT 
ATTCTGGTGG 
CAAAATTTAA 
TATAAACAGA 
TTATCATCAT 
TCCAATTGAC 
AACTTGAGAA 



Sequence D' 

It is recognized by those of skill in the art that additional control functions, relative to Sequences C and D, 
may be encoded in that portion of sequence D* which is further upstream (i.e., in the 5* direction) of the 
nucleotide sequence detailed in sequences C and D. 

To determine where RNA transcription for the alcohol oxidase gene is initiated, the DNA sequences 
around the 5* end of this gene from the genomic clone pPG 4.0 and the cDNA clone pPC 8.3 were 
compared. cDNA clone pPC 8.3 contains about 100 nucleotides of an untranslated region 5* to the alcohol 
oxidase gene. Based upon this sequence, an oligonucleotide of 15 bases (S'-CTTCTCAAGTTGTCG-S'); 
complementary with respect to nucleotides -29 to -43. where the A of the translation start site (ATG codon) 
is designated as +1 and the G in the 5' direction is designated as -1, was synthesized (See Example IX) 
and used as a primer to extend along the alcohol oxidase mRNA to reach the 5* end. The sequence of 
cDNA obtained from this primer-extension experiment revealed three different transcriptional initiation points 
for Pfchla pastoris alcohol oxidase mRNA The major transcript begins 114 nucleotides from the tran- 
slational initiation codon. Two minor alternative transcripts begin 117 and 111 nucleotides upstream (5*) 
from the alcohol oxidase AUG codon. 

The 55 nucleotides prec ding the start of alcohol oxidase mRNA contain a putative Goldberg-Hogness 
box (TATAA box). The sequence TATAAA occurs at position -40 from the 5* nd of th predominant 
transcript for alcohol oxidase mRNA and therefore 165 nucleotides upstream from the initiation codon for 
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this protein. 

The 3* regulatory region of the alcohol oxidase gene was further characterized by determining the 
nucleotide sequence for about 120 nucleotides downstream of the point where the structural information for 
p72 (alcohol oxidase) is encoded. The sequence is set forth below as Sequence D n : 



5 1 -TCAAGAGGAT GTCAGAATGC CATTTGCCTG AGAGATGCAG 

GCTTCATTTT TGATACTTTT TTATTTGTAA CCTATATAGT 

ATAGGATTTT TTTTGTCAAA AAAAAAAAAA AAAAAAAAAA-3 

Sequence D" 



Detailed Analysis of the p76 Gene 

The 5* regulatory region of the clone pPG 6.0 was also further characterized by determining the 
nucleotide sequence of the clone upstream (5') of the point where the structural information for p76 is 
encoded. The first 622 nucleotides prior to the mRNA translation start site (ATG codon) are believed to be: 









51 -XT 


CACCCATACA 


A CT AT AAA CC 


TTAGCAATTG 


AAATAACCCC 


AATTCATTGT 


TCCGAGTTTA 


ATATACTTGC 


CCCTATAAGA 


AACCAAGGGA 


TTTCAGCTTC 


CTTACCCCAT 


GAACAGAATC 


TTCCATTTAC 


CCCCCACTGG 


AGAGATCCGC 


CCAAACGAAC 


AGATAATAGA 


AAAAAACAAT 


TCGGACAAAT 


AGAACACTTT 


CTCAGCCAAT 


TAAAGTCATT 


CCATGCACTC 


CCTTTAGCTG 


CCGTTCCATC 


C CTTTGTTG A 


GCAACACCAT 


CGTTAGCCAG 


TACGAAAGAG 


GAAACTTAAC 


CGATACCTTG 


GAGAAATCTA 


AGGCGCGAAT 


GAGTTTAGCC 


TAGATATCCT 


TAGTGAAGGG 


TGTCCGATAC 


TTCTCCACAT 


T C AGTC AT AG 


ATGGGCAGCT 


TGTATCATGA 


AGAGACGGAA 


ACGGGCATAA 


GGGTAACCGC 


CAAATTATAT 


AAAGACAACA 


TGCCCCAGTT 


TAAAGTTTTT 


CTTTCCTATT 


CTTGTATCCT 


GAGTGACCGT 


TGTGTTTAAT 


ATAAAAAGTT 


CGTTTTAACT 


TAAGACCAAA 


ACCAGTTACA 


A C AAATT AT A 


ACCCCTCTAA 


ACACTAAAGT 


TCACTCTTAT 


CAAACTATCA 


AACATC AAAA- 3 1 







Sequence D r 1 ' 



The promoter function of clone pPG 6.0 is believed to be contained within this sequence of nucleotide 
bases, although those of skill in the art recognize that additional regulatory properties may be imparted by 
sequences further upstream than the sequences presented as Sequence D"'. 

The 3' regulatory region of clone pPG 6.0 was further characterized by determining the nucleotide 
sequence for about 180 nucleotides downstr am (3') of the point wher the p76 structural information is 
encod d. The sequence is set forth below as S qu nee D" M : 
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70 



5 1 -GTCAGCAGTG TTTCCTGCCA AAGCCATCAA GAGGACGTAC 

ATGCTCTCAT TTTTTGGTTT TCTATGTCCG ACGGGGTTCG 

TAAACTGGCT TCCTCCTTTT CCTTTCCTGT TGCATTTTAT 

TTGGTCAAAC AAAACTAGGG TCTTTTCCTA AAACCTTATG 

TCAATGGACC TACCACATAG-3 ' 

Sequence D 1 f 1 1 



Expression in Transformed Yeast 

75 

The above-described plasmids of the present invention have utility in yeast strains which can be 
transformed. Regulation of gene expression in yeast by the novel DNA fragments of the present invention 
can be accomplished by subjecting the transformed organisms to carbon source starvation. Carbon source 
starvation after growth on a variety of both catabolite repressing and non-catabolite repressing carbon 
20 sources induces expression of the gene product maintained under the control of the regulatory regions of 
the invention. Another means to achieve expression of the desired gene product in appropriate species of 
transformed yeast is to grow transformed yeasts on methanol. Yet another means to induce expression of 
the desired gene product is to grow transformed yeast on media containing non-catabolite repressing 
carbon sources. 

25 The regulatory regions of this invention are useful for expression in all yeast strains, since the 
regulatory regions have been shown to be induced under a variety of conditions. Thus, yeasts capable of 
growth on methanol or on non-catabolite repressing carbon sources can be caused to produce foreign, i.e., 
heterologous, polypeptides directly; while yeasts capable of growth on catabolite repressing carbon sources 
can be caused to produce foreign polypeptides by subjecting yeast cells so grown to conditions of carbon 

30 source starvation. 

Transformed yeast strains which are preferred in the process of the present invention include members 

of the genera: 

Candida, 

Kloeckera, 
35 Saccharomyces, 

Schizosaccharomyces, 

Rhodotoruta, 

Hansenula, 

Torulopsis, 
40 Pichia, and 

Kluyveromyces. 

Yeasts from these genera are preferred because their safety of handling, growth conditions and the like 

have been established and are well known to those of skill in the art. 

Especially preferred yeast strains for use in one embodiment of the process of the present invention are 
45 those yeast strains which are capable of growth on methanol as carbon and energy source. Yeasts known to 

be capable of growth on methanol include members of the genera: 

Candida, 

Kloeckera, 

Saccharomyces, 
so Rhodotoruta, 

Hansenula, 

Torulopsis, and 

Pichia. 

Since the regulatory regions of the present invention are also induced by growth on non-catabolite 
55 repressing carbon sources as well as conditions of carbon source starvation, y ast strains which are 
capable of growth on such non-m thanolic substrates as: 
glucose, 
acetate, 
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glycerol, 
ethanol, 
lactose, 
galactose, 
s fructose, 
sucrose, 

and the like and mixtures of any two or more thereof are also useful in the practice of the invention. By 
growing the host organism on a suitable non-catabolite repressing non-methanolic carbon source such as, 
for example, glycerol and galactose, or by growing the host organism on a suitable catabolite repressing 

io carbon source such as, for example, ethanol, glucose and fructose, then subjecting the host organism to 
carbon source starvation conditions, expression of a gene product under the control of the regulatory 
regions of the invention can be achieved. 

In addition, since the regulatory regions of the invention are responsive to a variety of growth conditions, 
both in terms of induction and repression of expression, the regulated expression of a gene product under 

is the control of the regulatory regions of the invention can be achieved. Thus, for example, cells can be 
grown on a carbon source which induces only low levels of foreign gene expression, then switched to 
methanol which will strongly induce gene expression. Alternatively, regulated gene expression can be 
achieved by employing mixtures of inducing/repressing feeds such as, for example, methanol-glucose 
mixtures. As yet another alternative, high expression levels produced by growth on methanol can be 

20 reduced as desired by addition to the growth media of a repressing carbon source such as glucose or 
ethanol. Of course, those of skill in the art recognize that other variations of feed mixtures and order of feed 
introduction are possible, and afford a great deal of control over the level of gene expression obtained from 
the invention regulatory regions. 

An especially preferred host yeast strain is the mutant Plchla pastoris GS115, which is a mutant 

25 defective in the ability to produce histidine, and has thus been designated as having the mutant genotype 
hi$4. GS115 is derived from Pichia pastoris NRRL Y-11430 and has been deposited with the Northern 
Regional Research Center of the United States Department of Agriculture in Peoria. Illinois. Pichia pastoris 
GS115 has been assigned the accession number NRRL Y-15851, as of August 31, 1984. This particular 
host is useful because it is an auxotrophic mutant deficient in the histidine pathway. Transformation of this 

30 host with a vector containing, among other DNA sequences, the HIS4 gene function, allows ready selection 
for transformed host. 

Since the regulatory regions of the present invention have also been demonstrated to be useful for the 
regulated expression of heterologous gene products in yeast strains of the genus Saccharomyces, for 
which a large number of auxotrophic mutants are known, additional preferred host yeast strains include 

35 ATCC 24683 (a trp1 ade1. his2, leul, gall, ural mutant), ATCC 24684 (a trpl, ade1. his7, gall, ural 
mutant), ATCC 32810 (a trp5, arg4, his5, Iys1, ade 2, ga! 2 mutant), ATCC 34182 (arTadelT his7"lys,"Tira 
mutant), ATCC 34352 (an ura 2 mutant), ATCC 34353 (an ura2 mutant), ATCC 38523 (an irg1, thrl mutant)^ 
ATCC 38626 (a jeu2, his4 mutant), ATCC 38660 (a his4, Ieu2, thr4 mutant), ATCC 42243~~(an u7a3 mutant), 
ATCC 42336 (an adel, his4. thr4 mutant), ATCC 42403 (an arg4, Iys7 mutant), ATCC 42404 (anadel. his4, 

40 Ieu 2 mutant). ATCC 42564 (an ural, his 6 mutant), ATCC 42596 (a his4, Ieu2, Iys 1 mutant), ATCC42957" (a 
his4, Ieu2, thr4, trp5 mutant). ATCC 42950 (an ade mutant). ATCC 42951 (an ade, leu mutant), ATCC 44069 
(an ural mutant). ATCC 44070 (a Ieu2, his4 mutant). ATCC 44222 (a his4 mutant^ ATCC 44376 (a his4, 
ade 2 mutant). ATCC 44377 (a his4, teul mutant), and the like are readily accessible to those of skill irTthe 
art. 

45 It is recognized by those of skill in the art that useful host strains are not limited to auxotrophic mutants. 
Thus, transformation of prototrophic strains with positive selection markers, such as, for example, antibiotic 
resistance genes, also provides a useful means for the detection and isolation of transformed strains. 

Escherichia coli is also a suitable host for the plasmids of the invention. Those of skill in the art 
recognize that many strains of £. coii are suitable hosts. Several strains employed in the present work are 

so summarized below: 



Strain designation 


Accession Number 


MC1061 

LE392 

MM294 


None known 
ATCC #33572 
ATCC #33625 
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Pichia pastoris Transformation Proc dur 

The transformation of Pfchia pastoris has not been previously described. The experimental procedures 
for transformation of Pichia pastoris are presented in greater detail below (Example XII). In order to develop 
s a transformation system for P. pastoris, the auxotrophic mutant GS115 (NRRL Y-15851) was isolated and 
determined to be defective in the histidine pathway in that the strain has no detectable histidinol 
dehydrogenase activity. 

GS115 (NRRL Y-15851) can be transformed by enzymatic digestion of the cell walls to give 
spheroplasts; the spheroplasts are then mixed with the transforming DNA and Incubated in the presence of 
10 calcium Ions and polyethylene glycol, then regenerated in selective growth medium deficient in histidine. 
The transforming DNA includes the HIS4 gene in which the host strain is deficient, thus only transformed 
cells survive on the selective growth medium employed. 

Isolation of Pichia pastoris HIS4 Gene 

75 

The HIS4 gene was isolated from the strain P. pastoris NRRL Y-11430 by partial digestion of total 
chromosomal DNA with Sau3A followed by centrifugation through sucrose gradients. (See Example XIII). 
Fragments of 5 to 20 kbp were cloned into the BamHI cleavage site of the S. cerevislae-E. coli shuttle 
vector YEp13 (ATCC 37115; Figure 33) and transformed into E. coli. Approximately 50,000 colonies were 

20 combined and total plasmid DNA extracted. Spheroplasts of S. cerevisiae strain 5799-4D (NRRL Y-15859), 
a his4ABC mutant, were mixed with about 1 ug of the YEp13 Pichia DNA library by the procedure of 
Hinnen et al (1978) and allowed to regenerate in a medium deficient in histidine. The transformation 
resulted in about 1x10 s prototrophic yeast colonies from a population of 5x1 0 7 total regenerable 
spheroplasts. A parallel control sample incubated without DNA produced no colonies. Total yeast DNA was 

25 extracted from 20 of the His + colonies and transformed back into E. coil. Seventeen of the yeast DNA 
preparations produced ampicillin resistant colonies. These cloned fragments were further characterized by 
restriction enzyme sizing and mapping as well as by their ability to cross hybridize with a labelled 5. 
cerevisiae HIS4 fragment at low stringency (post hybridization washes in 2xSSC at 55 *C* by the method 
described in Example XIII, §G. The HIS4-containing plasmids each contained one or more fragments which 

30 hybridized to the S. cerevisiae HIS4 gene. One such HIS4-containing plasmid was recloned to give a HIS4- 
containing plasmid designated pYJ8 and is shown in Figure 25. Plasmid pYJ8 contains pBR325 sequences, 
including chloramphenicol and ampicillin resistance genes, as well as the Pfchia HIS4 gene. 

The ARG4 gene was isolated from P. pastoris NRRL Y-11430 employing an analogous protocol and the 
Arg~ S. cerevisiae strain S2072A (a argA Ieu2 trpA gafZ mutant; obtained from the Yeast Genetic Stock 

35 Center, Berkely, CA). 

Those of skill in the art recognize that other marker genes from Pichia can similarly be isolated 
employing appropriately deficient S. cerevisiae strains. 

Isolation of Pichia pastoris Autonomous Replication Sequences 

40 

Another useful component of the vectors of the present invention are P/c/)/a-derived autonomous 
replication sequences (PARS), which enhance both the transformation frequency of GS115 (NRRL Y-15851) 
and the maintenance of plasmid as a stable extrachromosomal element 

To search for Pichia ARSs, DNA from Pichia pastoris GS115 (NRRL Y-15851) was partially digested 

45 with Taq\ and 5 to 10 kbp fragments were isolated and cloned into the unique C/al site of pYJ8A C/a. (See 
Figure 26). Plasmid DNA was recovered from about 10,000 His+ Pichia colonies and used to transform E. 
coil Plasmids from about 10,000 ampicillin resistant colonies were isolated and then transformed back into 
GS1 1 5. Forty of the His + yeast colonies from this sublibrary transformation were separately streaked onto 
selective medium and grown in separate cultures in selective medium. Total yeast DNA was extracted from 

50 each of these 40 cultures and transformed into E. coll Two plasmids, pYA63 (PARS1) and pYA90 (PARS2) 
whose yeast DNA preparations produced the most ampicillin resistant E. coli colonies, were selected for 
further analysis. Both of these plasmids transformed Pichia pastoris GS115 (NRRL Y-15851) at a very high 
frequency and each contained an insert of foreign DNA. 

As a measure for the ability of the ARSs to maintain plasmids as autonomous elements in Pichia, 

55 cultures of yeast cells which had b en transformed with each plasmid were grown in s lectiv m dium and 
p riodicaliy sampled. The state of th plasmid s quences in the cells was determined by Southern 
hybridization of unrestricted yeast DNAs to radioactively labeled pBR325. Plasmids pYA63 and pYA90 wer 
maintained in Pichia for at least 10 generations in th setectiv medium (but had integrated by 50 
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generations). 

One of the putative PIchia autonomous replication sequences (PARS1) was cloned into several other 
Pichia vectors to examine its ability to maintain the transforming DNA as an autonomous element. Plasmids 
pYJ30 (Figure 27) and pBPfl (Figure 34) were stilt present as autonomous elements after 20 generations of 
5 growth on selective media (His - ) and were present in multi-copies per cell. Southern blot analysis of cells 
transformed with pYJ30 indicate about 10 copies per cell. 

To determine if plasmids pSAOHS (see Figure 18) and pT76H4 (See Figure 22b), which contain PARS1 
contributed by pYJ30 and pBPfl, respectively, display similar stability to the plasmids from which they were 
derived, cells containing these vectors were grown under selective conditions for about 50 generations 
to under selective conditions (His") in the presence of glucose. The cells were then shifted to non-selective 
conditions (His*) and the loss of prototrophy was monitored. The stability of these plasmids was com- 
parable to the stability of pYJ30, including the rapid loss of His protrotrophy upon shift to non-selective 
media. Thus, it is believed that experiments carried out with plasmids containing the autonomous replication 
sequence. PARS1 , provide results of gene expression from autonomous plasmid DNA. 

75 

Novel /3-Galactosidase Gene Containing Constructs 

In order to demonstrate the ability of the regulatory regions of the present invention to control the 
production of protein products, novel DNA constructs were prepared. Thus the E coll lacZ gene was 

20 placed in several plasmids under the control of the regulatory regions of the genes encoding polypeptide 
p72 (alcohol oxidase) or p76. The preparation of plasmids pSAOHl, pSAOHS, pSAOHIO, pTAFH.85, 
pT76H1, pT76H2, pT76H3 and pT76H4 is described in Example XIV. 

Although the introduction of the regulatory region -£-galactosidase gene fusions of the invention into 
host yeast cells is described herein employing plasmids as the vehicle for introduction, those of skill in the 

25 art recognize that it is not necessary for the regulatory region-structural gene construct to be introduced into 
the cell via a plasmid. Hence, any molecule capable of being maintained in yeast can be employed. 
Therefore, the regulatory region-structural gene constructs of the invention can be manipulated via vectors 
other than plasmids. Alternatively, the regulatory region-structural gene construct can be integrated into the 
chromosome of the host yeast cell. 

so Those of skill in the art also recognize that the scope of the present invention is not limited to the 
production of /9-galactostdase under the regulation of the regulatory regions disclosed herein. The variety of 
polypeptides which can be produced under the regulation of the regulatory regions of the invention is 
limited only by the imagination of the reader. Many procedures exist for the preparation of DNA sequences 
which code for desired polypeptides. For example, oligonucleotides of various lengths can be synthesized 

35 by known procedures. Several such oligonucleotides can be assembled, in consequence of the specific 
base pairing properties thereof, into longer, double-stranded molecules. The component oligonucleotides of 
this double-stranded molecule can be joined (ligated) by the enzyme DNA ligase. Alternatively, DNA 
molecules having the desired coding sequence can be synthesized by use of the enzyme reverse 
transcriptase, using messenger RNA related to the desired polypeptide as a template for the action of 

40 reverse transcriptase. Yet another possibility is the cloning of genomic DNA fragments and observing 
whether direct expression of the desired product occurs. 

The DNA sequence which codes for the desired polypeptide can be modified for preparation of the 
regulatory region-structural gene construct by a variety of procedures. For example, the ends of the DNA 
prepared as described above can be ligated with the enzyme DNA ligase to short double-stranded DNA 

45 molecules which contain the nucleotide sequence recognized by specific restriction endonucleases, so 
called linker molecules. Digestion of these molecules with a specific restriction endonuclease following the 
ligation will generate termini corresponding to the specified restriction endonuclease recognition site at the 
ends of the prepared DNA sequence. 

Three specific regulatory region-0-galactosidase gene constructs prepared in the course of this work 

so are described in terms of restriction mapping data presented in Figures 15 and 16. The restriction map 
presented in Figure 15a describes a construct comprising a 0.85 kilobase pair H//?dlll-Sa/7?HI portion derived 
from the 5 V regulatory region of pPG 6.0 and the lacZ gene from E. coll (the 3.6 kilobase pair BamH\-Nru\ 
fragment shown). This same construct is present in each of the plasmids pTAFH.85, pT76H1 and pT76H2, 
to be described in greater detail below. (See Example XIV). The restriction map presented in Figure 15b 

55 describes a construct comprising a 1.3 kilobase pair H/ndlll-EcoRI portion derived from the 5* regulatory 
region of pPG 6.0 and the lacZ gene from E. coll. This sam construct is present in each of the plasmids 
pT76U1 , pT76H3 and pT76H4, to be described in greater detail below (see Example XIV). 

Figure 16 is a restriction map of a construct comprising a 1.1 kilobase pair 5coBI-0a/nHI fragment 
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derived from a portion of the 5 f regulatory r gion of pPG 4.0 and the /acZ gene from E. coll. This construct 
is present in each of the plasmids pSAOHl, PSAOH5 and pSAOHIO, to be described in greater detail 
below. (See Example XIV). 

Plasmid pSAOHl is illustrated schematically in Figure 17. In addition to containing the regulatory 
5 region-0-galactosidase gene fusion detailed in Figure 16, the plasmid is shown to contain: 

(a) pBR322 sequences, including the AmpP gene; 

(b) Pichia pastoris HIS4 gene; 

(c) S. cerevisiae 2u circle DNA; and 

(d) the interrupted URA3 gene from S. cerevisiae. 

io The plasmid therefore has the capability to transform and replicate in £ coif hosts and yeast hosts. 
Selectable markers are present for manipulation of the DNA in either E. colt or yeast hosts. 

Plasmid pSAOHS is illustrated schematically in Figure 18. The plasmid is similar to pSAOHl described 
above, except the S. cerevisiae 2u circle DNA and some of the Pichia pastoris HIS4 gene flanking DNA 
has been deleted while a Pichia pastoris autonomously replicating sequence (PARS1 from pYA63) has 

75 been added. 

Plasmid pSAOHl 0 is illustrated schematically in Figure 19. The plasmid contains: 

(a) regulatory region-0-galactosidase gene fusion; 

(b) pBR325 sequences, including genes conferring tetracycline resistance, chloramphenicol resistance 
and ampicillin resistance (fef*, cam* and amp*, respectively); and 

20 (c) S. cerevisiae HIS4 gene (obtained from plasmid pYA2 as described below). 

Plasmids pTAFH.85 f pT76H1 and pT76H2 are analogous to the above three described plasmids, except 
the regulatory region-0-galactosidase gene fusion employed was that described in Figure 15a (instead of 
the fusion described in Figure 1 6). 

Plasmids pT76H3 and pT76H4 are analogous to pSAOHl and pSAOHS, respectively, except the 
as regulatory region-jS-galactosidase gene fusion employed was that described in Figure 15b (instead of the 
fusion described in Figure 16). 

Plasmid pTAFH.85 is illustrated schematically in Figure 20 and comprises: 

(a) the regulatory region- j9-galactosidase gene fusion shown in Figure 15a; 

(b) p8R322 sequences, including the amp R gene; 
30 (c) Pichia pastoris HIS4 gene; 

(d) S. cerevisiae 2u circle DNA; and 

(e) the interrupted URA3 gene from S. cerevisiae. 

Plasmid pT76H1 is illustrated schematically in Rgure 21 and comprises: 
(a) the regulatory region-£-galactosidase gene fusion shown in Rgure 15a; 
35 (b) pBR322 sequences, including the ampP gene; and 

(c) Pichia pastoris HIS4 gene and autonomously replicating sequence (PARS1). 
Plasmid pT76H2 is illustrated schematically in Figure 22 and comprises: 

(a) the regulatory region-£-gaiactosidase gene fusion shown in Rgure 15a; 

(b) pBR325 sequences, including genes conferring tetracycline resistance, chloramphenicol resistance 
4o and ampicillin resistance; and 

(c) S. cerevisiae HIS4 gene. 

Plasmid pT76H3 is illustrated schematically in Rgure 22a and comprises: 

(a) the regulatory region-0-galactosidase gene fusion shown in Rgure 1 5b; 

(b) pBR322 sequences, including the amp R gene; 
45 (c) P. pastoris HIS4 gene; 

(d) S. cerevisiae 2u circle DNA; and 

(e) the interrupted URA3 gene from S. cerevisiae. 

Plasmid pT76H4 is illustrated schematically in Rgure 22b and comprises: 
(a) the regulatory region-0-galactosidase gene fusion shown in Rgure 15b; 
50 (b) pBR322 sequences, including the amp R gene; 

(c) Pichia pastoris HIS4 gene; and 

(d) Pichia pastoris autonomous replication sequence (PARS1). 
Expression of ff-Galactosidase in Yeast 

55 

Pichia pastoris GS115 (NRRL Y-15851) was transformed with th novel /9-galactosidase gene- 
containing constructs described abov . Several of the resulting transform d y ast strains have been 
deposited with th Northern Regional Research Center of th United States Department of Agriculture and 
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assigned deposit accession numbers as follows: 



Host 


Plasmid 


Accession Number of Transformed Strain 


GS115 


pSAOHt 


NRRL Y- 15852 


GS115 


pSAOHS 


NRRL Y-15853 


GS115 


pSAOHIO 


NRRL Y- 15854 


GS115 


pTAFH.85 


NRRL Y-1 5855 


GS115 


pT76H1 


NRRL Y-1 5856 


GS115 


pT76H2 


NRRL Y-1 5857 



The novel 0-galactosidase gene-containing constructs were also used to transform E. coli. Transformed 
bacterial strains have also been deposited with the Northern Regional Research Center in Peoria. Illinois. 
The transformed strains have been assigned the following accession numbers: 



Host 


Plasmid 


Accession Number of Transformed Strain 


MC1061 


pSAOHl 


NRRL B-15861 


MC1061 


pSAOHS 


NRRL B-15862 


MC1061 


pSAOHIO 


NRRL B-15863 


MC1061 


pTAFH.85 


NRRL B-15864 


MC1061 


pT76H1 


NRRL B-15865 


MC1061 


pT76H2 


NRRL B-15866 


MC1061 


pTA013 


NRRL B-15875 


MC1061 


pT76H3 


NRRL B-18000 


MC1061 


pT76H4 


NRRL B-18001 


MC1061 


PT76U1 


NRRI B-18002 



so Pichia pastoris GS115(NRRL Y-1 5851) transformed with each of the first eight plasmids described 
above which contain the alcohol oxidase and p76 regulatory region-/acZ gene fusions of the invention were 
grown to stationary phase on minimal medium supplemented with biotin plus glucose as carbon source. 
Once cells reached stationary phase, they were shifted to minimal medium supplemented with biotin plus 
methanol as carbon source. After cells had grown for about 3-5 generations at 30 " C, they were shifted to 

35 fresh minimal medium supplemented with biotin and grown on glucose or methanol as carbon source. At 
distinct time points, culture samples were withdrawn and analyzed for the presence of 0-gaIactosidase and 
alcohol oxidase by methods detailed in Examples VII and XV. 

It was found that cells grown on glucose as carbon source produced no detectable levels of 0- 
galactosidase or alcohol oxidase, while cells grown on methanol as sole carbon source expressed significant 

40 levels of both alcohol oxidase and £-gaIactosidase. It was also found that the glucose grown cells, when 
subjected to conditions of carbon source starvation, also expressed measurable quantities of alcohol 
oxidase as well as 0-galactosidase. Thus, it is clear that the regulatory regions of the invention are 
responsive to both the presence of methanol as well as conditions of carbon source starvation. 

As verification that the regulatory regions of the invention are responsive to growth on non-catabolite 

45 repressing carbon sources as well as conditions of carbon source starvation, a plasmid containing the 
alcohol oxidase regulatory region, pTA013; and a plasmid containing the p76 regulatory region, pT76U1. 
was used to transform a non-methanol utilizing strain of yeast, Saccharomyces cerevislae. One of the 
transformed strains employed, having the laboratory designation of SEY2102-pTAO13 has been deposited 
with the Northern Regional Research Center in Peoria, Illinois. The transformed strain has been assigned 

bo accession number NRRL Y-1 5858. Saccharomyces cerevislae NRRL Y-1 5858 and SEY2102-pT76U1 were 
grown up on glucose, fructose, ethanol, glycerol and galactose for about five generations then subjected to 
conditions of carbon source starvation. The usual assay for 0-gaJactosidase (See Example XV) after five 
generations indicated that glycerol and galactose grown cells produced large amounts of 0-galactosidase 
while glucose and fructose grown cells produced essentially no £-galactosidase. When l-gafactosidase was 

55 measur d after 6 hours under carbon sourc starvation, th production of moderate quantiti s of &• 
galactosidase by the transformed organisms grown on glucos and fructose as well as substantial quantiti s 
of ^-galactosidase produced by glycerol and galactose grown cells was observed. Thus, the r gulatory 
r gions of the invention are capable of controlling th production of protein products in g netically very 
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diverse yeast hosts and are not limited to utilization in methanol utilizing strains. 
EXAMPLES 

The buffers and solutions employed in the following examples have the compositions given below: 



1M Tris buffer 121.1 g Tris base in 800 aL of H 2 0; 

adjust pH to the desired value by 
adding concentrated (35%) aqueous HC1; 
allow solution to cool to room 
temperature before final pH adjustment, 
dilute to a final volume of 1L. 
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S-buffer 



l.S M sorbitol 

in 0.04 M sodium 

phosphate buffer at pH 6.6. 



PK buffer 



0.14 M NaCl 

1% sodium dodecyl sulfate (SDS) 
0.01 M EDTA 

in 0.05 M (pH 8.4) Tris buffer 



ETS buffer 



lOmM EDTA 
0.2 % SDS 

in 0.01 M (pH 7.4) Tris buffer 



TE buffer 



1.0 mM EDTA 

in 0.01 M (pH 7.4) Tris buffer 



ssc 



0.15 M NaCl 

15 mM sodium citrate 

adjusted to pH 7.0 with NaOH 



TAE 



PBS (Phosphate 
buffered saline) 



40 mM acetic acid 
5 mM EDTA 

in 0.02 M (pH 8.3) Tris buffer 

10 mM sodium phosphate (pH 7.0) 
0.15 M NaCl 



Laemmli Loading 
Buffer 



62.5 mM Tris-HCl (pH 6.8) 

2% SDS 

10% glycerol 

5% 2-mercaptoethanol 

0.01% bromphenol blue 
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RIPA Buffer 



IX NP40 (Sigma) 

1% sodium deoxycholate 

0.1% SDS 

in PBS 



20XSSPE 



20 mM EDTA 

0.16 M NaOH 

0.2 M NaH 2 P0 4 *H 2 0 

3.6 M NaCl 

adjusted pH to 7.0 with NaOH 



Denhardts • Solution 
(50x) 



5 g Ficoll 

5 g polyvinylpyrrolidone 

5 g Bovine serum albumin (BSA; Pentax 
Fraction V) 

brought to a total volume of 500 mL 
with water 



Prehybridization 
buffer 



LB (Luria-Bertani) 
Medium 



5x SSPE 

5x Denhardt's solution 
50% deionized formamide 
0.2% SDS 

200 pg/mL sheared and denatured herring 
sperm DNA 

5 g Bacto-tryptone 

5 g Bacto-yeast extract 

2.5 g NaCl 

in 1 L of water, adjusted to pH 7.5 
with NaOH 
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YPD Medium 1% Bacto-yeast extract 

2% Bacto-peptone 
2% Dextrose 

6.75 g yeast nitrogen base 

without amino acids (D1FCO) 
2% Dextrose 
in 1 L of water 

1 M Sorbitol 
25 mM EDTA 
50 mM DTT 

9.1 g Sorbitol 
1.47 g Sodium citrate 
0.168 g EDTA 
50 mL H 2 0 

--pH to 5.8 with HC1 

1 M Sorbitol 
10 mM CaCl 2 
— filter sterilize 

PEG Solution 20% polyethylene glycol-3350 

lOmM CaCl, 
35 _ 

lOmM Tris-HCl (pH 7.4) 
— filter sterilize 

40 SOS 1 M Sorbitol 

0.3x YPD medium 
10 mM CaCl 2 



SD Medium 

10 



SED 



T5 



SCE Buffer 

20 



25 



CaS 

30 



50 



55 
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Formamide dy mix 0.1% xylene cyanol FF 

0.2% bromphenol blue 
10 mM EDTA 

95% deionized form amide 



Top gel 76.8 gra urea 

24 mL acrylamide stock 
8 mL lOx TBE 

bring to final volume of 160 mL 



Acrylamide stock 38 gm acrylamide 

2 gm bis(N,N-methylenebisacrylamide) 
add water to total volume of 100 mL 



Bottom gel 14.4 gm urea 

3.0 gm sucrose 

7.5 mL 10X TBE 

4.5 mL acrylamide stock 

0.3 mL bromphenol blue solution 

(0.01 g/mL) 

add water to give total volume of 30 mL 



Prehybridization 
Buffer for 
hybridization 

selection 50% formamide 

0.75 M NaCl 
0.1 M Tris, pH 7.4 
0.008 M EDTA 
0.5% SDS 

200 pg/mL rabbit liver tRNAs (Sigma) 
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0.5 M NETS 
Buffer 



0.5 M NaCl 

10 mM EDTA 

10 mM Tris, pH 7.4 

0.2% SDS 



10X RT Buffer 



dil RT 



500 mM NaCl 

34fr mM Tris f pH 0.3 

60 mM MgClj 

50 mM DTT (dithiothreitol ) 

4 uL H 2 0 

1 uL 10X RT Buffer 

5 pL reverse transcriptase, 15 U/pL 

(Life Sciences, Inc. ) 



dideoxy : 

dd ATP 
dd OTP 



0.49 mM 
0.1165 mM 



dd CTP 


0.369 


mM 




dd TTP 


0.82 mM 




dNTP mix 


0.625 


mM 


dGTP 




0.625 


mM 


dATP 




0.625 


mM 


TTP 


Chase 


1.125 


mM 


dATP 




1.125 


mM 


dCTP 




1.125 


mM 


dGTP 




1.125 


mM 


TTP 




in IX 


RT 


buffer 



Unless otherwise specified, the above solutions represent the basic (1x) concentration employed. Through- 
out the examples* where the different concentration levels are employed, that fact is indicated by referring 
to the solution as a multiple of the basic (1x) concentration. 

The following abbreviations are used throughout the examples, with the following meaning: 



EDTA 


ethylenediamine tetraacetic acid 


TEMED 


N t N,N\N*-tetram ethylenediamine 


DTT 


dithiothreitol 


BSA 


bovine serum albumin 


EtBr 


ethidium bromide 


Ci 


Curie 


dATP 


deoxyadenosine triphosphate 


dGTP 


deoxyguanosine triphosphate 
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TTP thymidine triphosphate 

dCTP deoxycytidine triphosphate 

dXTP "generic" deoxy triphosphate nucleotide 

oligo(dT) 12 -i8 Source: Collaborative Research, Inc. 

5 Zymolyase 60.000 Source: Miles Laboratories 

Several procedures carried out on a routine basis follow a standard protocol which will be detailed here. 
Centrifugation is carried out for a period of time and at a spin rate sufficient to provide a clear 
supernatant. Generally, centrifugation of yeast cells is carried out at at least 1500 g for at least 5 minutes. 
Nucleic acid extractions with phenol/chloroform/isoamyl alcohol involve contacting the nucleic acid 
io containing solution with an equal volume of a 50:48:2 ratio by volume mixture of phenol, chloroform and 
isoamyl alcohol, respectively. Extractions with chloroform/isoamyl alcohol involve contacting the solution to 
be treated with an equal volume of 48:2 ratio by volume mixture of chloroform and isoamyl alcohol. 

When gels, Miters, etc. are described as being washed or soaked in a specified solution, the entire gel, 
filter, or the like was immersed in an appropriate vessel (pan, dish, vial, etc.) in order to contact the entire 
75 surface of the gel, filter, or the like with the solution of interest. 

Ethanol precipitation of nucleic acids involves first adjusting the salt content of the nucleic acid- 
containing solution, then contacting the solution with two volumes of cold ethanol. 

EXAMPLE I 

20 

Growth and Preparation of Yeast Cetls 

Pichla pastoris NRRL Y-11430 was grown under carbon limited conditions in continuous culture at 
30 • C with either methanol or ethanol as sole carbon source in I Ml salts minimal medium as described by 

25 Wegner in U.S. 4,414,329. IM1 minimal media contains, per liter of media, 36 mM KH 2 PO+. 23m M (NH + >- 
2SO4. 2mM MgSO*. 6.7 mM KCI. 0.7 mM CaCI 2 . 0.2 uM CuSCVS H 2 0, 1.25 uM~KI. 4.5 uM MnSOi. 2 uM 
NaaMoO*. 0.75 wM H3BO3, 17.5 uM ZnSO*. 44.5 uM FeCI 2 and 1.6 uM biotin. The cells grown on 
methanol were grown up to a celt density of 140 g/L (dry weight) with a retention time of about 12 hours. 
The cells grown on ethanol were grown up to a cell density of 90 g/L with a retention time of about 11.5 

30 hours. When methanol or ethanol were fed into the fermenter, feed stocks containing concentrations of 20% 
and 45% alcohol, respectively, were used. 

Ten grams of fermenter grown Pichla pastoris cells were collected by centrifugation and resuspended 
at approximately 10 s cells/mL in 0.1 M Tris (pH 8.0) containing 1% 2-mercaptoethanol. These cells were 
incubated for 5 to 10 minutes at 37 *C and collected by centrifugation. The pellet was washed once with 30 

35 mL of S-buffer and resuspended in 5 mL of S-buffer per gram of cells. Zymolyase (Miles Biochemicals) was 
added to the cell suspension to give a final concentration of 500 ug/mL. The cells were incubated at 37 • C 
for 20 minutes and then centrifuged; supernatant discarded and the cell pellet collected. This pellet was 
frozen in liquid nitrogen and stored at -70 * C for later use. 

40 EXAMPLE II 

Isolation of Yeast RNA 

Total cell RNA was prepared by pulverizing the frozen pellet prepared as described in Example I with a 
45 mortar and pestle and further disrupting the frozen pellet for about 2-5 minutes in a Waring Blender in the 
presence of liquid nitrogen. The pulverized pellet was added to PK buffer at a concentration of 7.5 mL per 
gram of cells. Proteinase K (Boehringer Mannheim) was added to the resuspended pellet to give a final 
concentration of 400 ug/mL. and the suspension was incubated at room temperature for 10 minutes. This 
mixture was extracted with phenol/chloroform/isoamyl alcohol followed by a chloroform/isoamyl alcohol 
50 extraction. Nucleic acids were precipitated by adjusting the solution to be 0.25 M NaCI and adding ethanol. 
The pellet was resuspended in a minimum volume of ETS buffer, i.e. that volume of buffer sufficient to 
dissolve the nucleic acids; generally, about 100 ug up to about 1 mg of DNA per mL of solution. This 
solution was re-extracted with phenol/chloroform/isoamyl alcohol, then chloroform/isoamyl alcohol and finally 
precipitated with ethanol. 

55 The nucleic acids were redissolved in a minimum volume of TE buffer. The RNA present in this solution 
was nriched either by centrifugation through a 4 mL CsCI cushion (1 g CsCl/mL, 1 mM EDTA, in 10 mM 
Tris (pH 7.4) buffer), or by precipitation by making the solution 2 M LiCI, maintaining "at 4-8 *C overnight 
and coll cted by c ntrifugation. The poly A+ RNA was selected from the solution by affinity chromatog- 
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raphy on oligo(dT)cellulos columns. Generally, 0.25 g of oligo (dT) cellulose, typ 3 (Collaborative 
Research) was prepared for chromatography per 5 to 1 0 mg of total RNA. 0.25 g of oligo (dT) cellulose was 
slurried in 2 mL of ETS buffer and poured into a small, siliconized glass column. This oligo (dT) cellulose 
column was washed by layering 10 mL of 0.1 M NaOH over the oligo (dT) cellulose and allowing the wash 

5 solution to flow through the oligo (dT) cellulose matrix. The oligo (dT) cellulose was then washed in the 
same manner with 10 mL of ETS buffer and washed a final time with 10 mL of 0.5 M NETS buffer. 

Total RNA (5 to 10 mg) was resuspended in ETS buffer at a concentration not greater than about 10 
mg/mL, placed in a 65 • C water bath for 2 minutes and then placed immediately on ice. The RNA solution 
was then allowed to warm to room temperature and a stock solution of 5 M NaCI was added to give a final 

to salt concentration in the RNA solution of 0.5 M NaCI. The resulting RNA solution was layered onto the 
prepared oligo (dT) cellulose column and allowed to slowly flow through the column at a rate of about 1 
drop/5 seconds. The material flowing out the column bottom was collected in a tube and relayered onto the 
top of the column. The material collected from the column bottom was relayered on top a second time, 
resulting in the RNA solution being passed through the oligo (dT) cellulose column a total of three times. 

is After the last pass through the column, the material was collected and labelled as the poly A-, i.e.. non-poly 
A RNA. The column was then washed with 30 mL of 0.5 M NETS and finally the poly A+ RNA was eluted 
from the column by loading 5 mL of ETS buffer onto the~column and allowing this buffer to flow through 
slowly, collecting the poly A+ RNA fraction in the 5 mL fraction flowing from the bottom of the column. 
Assuming that there was no NaCI in the poly A+ RNA fraction, the NaCI concentration of this fraction was 

20 adjusted to 0.25 M NaCI and RNA precipitated with ethanol. 

EXAMPLE 111 

Construction of cDNA Library 

25 

Complementary DNA (cDNA) clones were synthesized as follows. Ten ug of poly A + RNA prepared as 
described in Example II was resuspended in 7 uL H 2 0 and brought to a final concentration of 2.7 mM 
CHaHgOH, then incubated at room temperature for 5 minutes. The first strand of cDNA was synthesized at 
42* C for 15 minutes in 50 uL of a solution containing 50 mM Tris, (pH 8.3 at 42* C), 10 mM MgCb. 30 mM 

30 2-mercaptoethanol, 70 mM KCI, 500 uM each of dATP. dGTP, and TTP, 200 uM dCTP,~25 ug/mL oligo^ 
(dT), 60 ug/mL actinomycin D, 25 unite" RNasin (Biotec, Inc.), 25 uCi a-&P dCTP (32.5 pmoies), and 120 
units of reverse transcriptase (Life Sciences Inc.). This reaction mix was incubated at 37 • C for an additional 
15 minutes. The reaction was terminated by the addition of 2 uL of 0.5 M EDTA and 0.5 uL 20% SDS. The 
reaction was adjusted to 0.3 M NaOH and incubated at 65 * C for 30 minutes. The reaction mix was then 

35 neutralized by the addition of 10 uL of 1 M Tris. (pH 7.4) and adjusting the reaction mix to 0.21 M HCl. The 
reaction mix was phenol/chloroform/isoamyl alcohol extracted, then chloroform/isoamyi alcohol extracted 
and finally chromatographed over a Sephadex G50 column in TE buffer. The radioactive single-stranded 
cDNA was pooled into one fraction and concentrated to 100 uL either by butanol extraction or evaporation 
by centrifugation under vacuum. The single stranded cDNA was ethanol precipitated from the concentrated 

40 solution, cDNA collected by centrifugation and resuspended in 1 00 uL of water. 

The aqueous single-stranded cDNA solution was adjusted to 2.5 M ammonium acetate, ethanol 
precipitated, collected by centrifugation and resuspended in 20 uL of water. This single stranded DNA 
solution was brought to a final volume of 50 uL with 50 mM potassium phosphate buffer (pH 7.4) containing 
5 mM MgCfe, 1 mM 2-mercaptoethanol, 250 uM each of dATP. dGTP, and TTP. 125 uM dCTP, 25 uCi-a- 

45 ^P-dCTP (32.5 pmoies), and 8 units of Klenow fragment DNA Poll (New England Bioiabs). The resulting 
reaction mixture was incubated at 37* for one hour in order to synthesize the complementary second DNA 
strand to the single stranded cDNA. The reaction was terminated by the addition of 2uL of 0.5 M EDTA. 
The double stranded cDNA was phenol/chloroform/isoamyl alcohol extracted, chloroform/isoamyT alcohol 
extracted and chromatographed over a Sephadex G50 column in TE buffer. The double stranded cDNA 

so fractions were pooled and the pool was concentrated and precipitated as described for the single-stranded 
cDNA. 

After the final ethanol precipitation and the collection of the double stranded cDNA by centrifugation, the 
pellet was resuspended in 20.25 uL of water, then brought to a final volume of 50 uL with 50 mM Tris, (pH 
8.3 at 42* C), containing 10 mM MgCfe, 30 mM 2-mercaptoethanol. 70 mM KCI, 500 uM of dXTP. and 150 
65 units of reverse transcriptase. The resulting solution was incubated at 42 * C for 15 "minutes in rder to 
insure completion of the synthesis of the s cond strand of cDNA. Th reaction was terminated by the 
addition of 2 uL of 0.5 M EDTA and cone ntrated and precipitated as described for the single stranded 
cDNA reaction. 
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Th doubl stranded cDNA pellet was resuspended in 42 uL of H2O and the solution brought to a finaJ 
volume of 47 uL by the addition of 5 uL of a stock solution containing 2.8 M NaCI, 200 mM NaOAc and 45 
mM ZnSO*. then adjusted to a pH of 4.5 at 22* C with HCI. In order to digest the hairpin loop, three 
separate reactions were done with three different concentrations of S% nuclease (Sigma). One unit, 10 units 

5 or 100 units of Si nuclease were added to bring the reaction volume to 50 uU and the reaction incubated at 
22 * C for 30 minutes. The reaction was terminated by the addition of 2 uL of 0.5 M EDTA and 2.67 uL of 2 
M Tris base. Six ug of rabbit liver tRNA were added as a carrier, and the reaction~mixture was concentrated 
and precipitated as described above except the DNA pellets were resuspended in TE buffer rather than 
water. After the final precipitation, the pellet was resuspended in 20 uL of TE buffer and brought to a final 

70 volume of 50 uL in terminal transferase buffer (BRL) containing 10 pmoles of a-^P-dCTP, 2 u,M dCTP and 
21 units of terminal transferase (Ratliff Biochem). The resulting solution was incubated at 37*C for 30 
minutes in order to add poly d(C) tail to the 3*-OH end of the double-stranded cDNA. The reaction was 
terminated by the addition of 5 uL of 0.5 M EDTA, extracted, chromatographed, and stored as an ethanol 
precipitate. 

75 The double stranded, d(C) tailed cDNA was either reannealed directly to poly d(G) tailed pBR322 
opened at the P$t\ site or first size fractionated on a Sepharose CL4B-200 column <25 uL fractions). For the 
unfractionated library, 150 ng of double-stranded poly d(C) tailed cDNA were annealed in 180 uL of 10 mM 
Tris, (pH 7.4) which is 0.1 M in NaCI and 1 mM in EDTA to 900 ng of d(G) tailed pBR322 opened at the 
Psfl site. Each 25 uL fraction of the fractionated library was annealed to 125 ng of poly d(G) tailed pBR322 

20 in a 50 uL final volume of the same annealing mixture described above. The annealing reactions were 
incubated at 65* C for 3 minutes, then 42* C for 2 hours and allowed to cool slowly to room temperature. 

The annealed cDNA library was transformed into competent E. coll LE392 (ATCC 33572) prepared as 
follows: An inoculum of LE392 was grown overnight at 37 • C in 2x LB media. Five mL of this overnight 
culture was inoculated into 200 mL of fresh 2x LB media and grown to an ODgoo of 0.2-0.3 at 37 *C. This 

25 culture was placed on ice for 10 minutes and the cells were then collected by centrifugation at 4*C. The 
cell pellet was resuspended in 80 mL of ice cold 0.1 M CaCI 2 and incubated for 25 minutes at 4*C. The 
cells were collected by centrifugation at 4*C. the cell "pellet resuspended in 2m L of ice cold 0.1 M CaCI 2 
and incubated for at least 2 hours at 4*C prior to use. Then 200 uL of competent cells per 50 uL of 
annealing mix were used for the transformation. The competent cells and the DNA were combined and 

30 incubated at about 4*C for ten minutes, followed by an incubation at 37*0 for 5 minutes and finally placed 
on ice for 10 minutes. An equal volume of 2X LB media was added to the transformation mix and incubated 
at 37*0 for 45 minutes. The transformed cells were plated at 250 u Lip late on 150 mm 2x LB plates 
containing 15 ug/mL of tetracycline. The plates were incubated at 37* C for 24 hours and stored at 4* C. 
Replica filters were prepared by stamping nitrocellulose filters onto an original filter used to lift the 

35 colonies off of the plate. These replica filters were incubated on 2x LB-Tet (15 ug/mL of tetracycline) plates. 
The colonies on the filters were prepared for probing by transferring the filters to 2x LB-Tet plates 
containing 200 ug/mL of chloramphenicol, incubating the filters at 37*0 for at least 12 hours, then lysing 
the colonies by floating the filters on an aqueous pool which is 1.5 M NaCI and 0.5 M NaOH for 10 minutes. 
The filters were then neutralized by floating them on an aqueous pool which is 1 .5 M NaCI and 0.5 M Tris, 

40 (pH 7.4) for 15 minutes and repeating this neutralization again. The filters were then air dried andlinally 
dried under vacuum for 2 hours at 70 * C. 

EXAMPLE IV 

45 Colony Hybridization 

The vacuum dried nitrocellulose filters containing the cDNA library (prepared as described in the 
previous example) were prehybridized at 42 "C. for 5 hours in prehybridization buffer. The fitters were 
removed from the prehybridization buffer and lightly rubbed with a gloved hand in 5x SSPE in order to 

50 remove cell debris. The filters were placed in hybridization buffer (same as prehybridization buffer except 
1x Denhardfs). Either ^P-labelled single-strand cDNA (10 6 cpm/mL) or end-labeled poly A+ RNA was 
hybridized to the filters for 17 hours at 42* C. After hybridization, the filters were washed briefly in 2x SSPE 
at 22* C. followed by two washes at 65*0 in 0.1x SSPE. 10 minutes each. 

End-labeling of poly A+ mRNA was performed by adding 2 ug of poly A+ mRNA to a volume of 50 

55 uL containing 50m M Tris, (pH 9.5) heating to 100* C for three minutes, and rapidly chilling on ice. This RNA 
solution was dilut d~to a final volume of 200 uL and adjusted to 50 mM Tris, (pH 9.5) 10 mM MgCfe, 5mM 
DTT and 50 pmoles of ^P-a-ATP. Ten units of T4 polynucleotid kinase (Boehringer Mannheim) was added 
and the mixtur incubated at 37* C for one hour. The kinasing reaction was terminated by the addition of 10 
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UL of 0.5 M EDTA, extracted with ph nol/chloroform/isoamyl alcohol and chromatographed through 
Sephadex G50 to remove the unincorporated radioactive label. 

EXAMPLE V 

5 

Northern Hybridizations 

*■ 

Two to five ug of poly A+ mRNA were heated at 65*,C for 5 minutes in 10 mM sodium phosphate 
buffer (pH 7.4) containing 50% formamide, 2.2 M formaldehyde, and 0.5 mM EDTA. The resulting solution 

10 was cooled to room temperature and an appropriate amount (generally about 0.2 volumes based on the 
volume of sample treated) of 5x sample buffer (0.5% SDS, 0.025% bromophenol blue. 25% glycerol, 25 
mM EDTA) was added. The samples were loaded on a 1.5% agarose gel prepared in 10 mM sodium 
phosphate buffer (pH 7.4), containing 1.1 M formaldehyde, and electrophoresed in the same buffeTT The gel 
was stained with acridine orange (33 ug/mL) in 10 mM sodium phosphate buffer (pH 7.4), destained by 

is soaking the gel in the same buffer for 10 minutes, soaked in lOx SSPE for at least 10 minutes, and the RNA 
transferred to nitrocellulose as described in Example VI. 

EXAMPLE VI 

20 Isolation Of Genomic DNA And Clones 

P/chia genomic DNA was isolated using the method described in Example II for Pichfa RNA isolation. 
The nucleic acid pellet was resuspended in a minimum volume TE buffer, and incubated with 20 ug/mL 
RNase A for 30 minutes at 37' C. The solution was brought to 0.14 M NaCI and treated with proteinase K at 

26 200 ug/mL for 15 minutes at 22 'C. The resulting solution was first extracted with phenol/chloroform/isoamyl 
alcohol and then with chloroform/isoamyl alcohol and finally ethanol precipitated. The precipitated DNA was 
resuspended in a minimum volume of TE buffer, and centrifuged in order to clear the DNA solution. 

Ten ug of Plchla genomic DNA prepared as described in the previous paragraph was digested with 
various restriction enzymes (BRL) and electrophoresed on a 1% agarose gel containing TAE. The DNA 

30 fragments in the gel were denatured by soaking the gel in 1.5 M NaCI, 0.5 M NaOH for 20 minutes. The gel 
was neutralized by soaking in 1.5 M NaCI, 0.5 M Tris, (pH 7.4) for 20 minutes. Prior to transfer, the gel was 
soaked in 10x SSPE for at least 5 minutes. A sheet of nitrocellulose was cut to the size of the gel, wetted in 
water and soaked briefly in 10x SSPE. This filter was laid on top of the gel which in turn had been placed 
on a piece of parafilm. A sheet of Whatman filter paper and a stack of paper towels were placed on top of 

35 the nitrocellulose in order to draw the DNA out of the gel and transfer it to the nitrocellulose. A weight was 
placed on the stack to facilitate transfer. The DNA was allowed to transfer in this manner for at least 3 
hours. After the transfer, the filter was soaked in 5x SSPE briefly, air dried, and dried under vacuum at 
70 'C for 2 hours. Complementary genomic fragments were identified by hybridization to nick-translated 
cDNA clones pPC 8.0. pPC 6.4 and pPC 15.0 using the same prehybridization, hybridization, and washing 

40 buffers described in Example IV. 

200 ng of the cDNA clones were nick-translated for 80 minutes at 14* C in 30 uX of a solution 
containing 50 mM Tris-HCI (pH 7.4), 10 mM MgSCV 100 uM DTT, 50 ug/mL BSA, 20 uM each of dGTP, 
TTP and dATP, 31.25 pmoles 32 P-*-dCTP (3200 Ci/mmol, NEN), 2 units E. coll DNA PoF(BRL), and 0.02 
ng DNasel. The reaction was terminated by the addition of 1 uL of 0.5 M EDTA and 1 uX of 20% SDS. The 

45 labelled DNA solution was brought to a final concentration of 0.3 M NaOH and placed in boiling water for 3 
minutes. This mixture was chromatographed on a Sephadex G50 column. The labelled DNA fractions were 
pooled, the specific activity determined and the probe used in hydridization experiments. 

Genomic fragments which hybridized to the cDNA probes were isolated by digesting 200 ug of Pichia 
genomic DNA with various restriction enzymes (BRL) and electrophoresing the digest on a 1 % agarose gel 

so in TAE buffer. The appropriate sized band was sliced from the agarose gel, the DNA electrocuted, passed 
through an Elutip column (Schleicher and Schuell) and ethanol precipitated. 

The electrocuted fragments were resuspended in water and 200 ng fragments were ligated to 500 ng of 
pBR322 which was cleaved at the appropriate restriction site and dephosphorylated when necessary. The 
ligation reaction was carried out in 300 UL of 66 mM Tris, (pH 7.4) containing 6.6 mM MgCI 2 , 10 mM DTT, 

55 0.4 mM ATP, 25 ug/mL BSA, and 40-80 units of T4 DNA Hgase, then incubated at~4>C for 24 hours. The 
ligation mix was transformed directly into competent LE392 E. coll cells. The cells wer made competent 
and the transformation done as d scribed in Example III. A series of three transformations wer don with 
10, 40, and 100 ng of pBR322 (plus insert), each transformation in 100 iiL of competent cells. The cells 
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were plated as describ d in Exam pi 111 xcept the antibiotic selection was 50 ug/mL of ampicillin. The 
clones were transferred to nitrocellulose, replicated and prepared for hybridization as described in Example 
III. The filters were probed with the appropriate nick-translated cDNA fragment. Streak-purified colonies 
which were positive in the hybridization were used to prepare additional plasmid, as follows. 

s The plasmid bearing LE392 E. coli was grown to an ODsoo of 1.0 in 1x LB media containing 50 ug/mL 
of ampicillin and amplified overnight by the addition of chloramphenicol to a final concentration of 200 
ug/mL. The cells were washed in 0.8% NaCI, 20 mM Tris, (pH 8.0) 20 mM EDTA, then lysozome treated in 
25% sucrose, 50 mM Tris, (pH 7.4) and 20 mM E~DTA with 450 Rg/mLlysozyme. Lysis was achieved by 
adding 5 M NaCI to" a final concentration of 2U M followed by the addition of an equal volume of 0.2% 

w Triton X-100 and 40 mM EDTA. The preparation was cleared by spinning at 20,000 RPM for 45 minutes. 
The supernatant was then phenol/chloroform/isoamyl alcohol extracted, chloroform/isoamyl alcohol extracted 
and EtOH precipitated. The pellet was resuspended in TE buffer, RNase A treated, 
phenol/chloroform/isoamyl alcohol extracted and chloroform/isoamyl alcohol extracted. Solid CsCI was 
added to give a final concentration of 800 ug/mL plus EtBr was added to give a final concentration of 1 

75 mg/mL The resulting solution was spun in a Vti 50 rotor at 49,000 RPM for 18-20 hours at 20* C. 

The plasmid band was visualized by UV fluorescence and drawn from the tube using a needle and 
syringe. The plasmid solution was n-butanol extracted four times and ethanol precipitated at -20 * C. The 
ethanol precipitation was repeated at least twice to remove all of the CsCI. The plasmid was stored at 
-20 * C as an ethanol precipitate. 

20 

EXAMPLE VII 

Purification Of Alcohol Oxidase 

25 Protein samples from Pichla pastoris cells grown on methanol as described in Example I were 
prepared by lysis of yeast cells, followed by a clearing spin to remove cell debris, as follows: A portion of 
the fermenter effluent was removed and adjusted to pH 7.5 with ammonium hydroxide, and was homog- 
enized on a Dyno-Mill Model KDL using a 0.6 liter vessel in a continuous operation at 30 *C. using belt 
combination #3 and a flow of 20-30 mUhr. The beads in the mill were lead free glass beads with a diameter 

30 of 0.3-0.5 mm. The resulting homogenate was centrifuged at 5*C. and 20,000Xg for 30 minutes to yield a 
cell-free supernatant. 

Six 130 mL portions of the cell-free supernatant were placed in cellulose acetate dialysis bags and 
dialyzed at 5 * C. against about 8 liters of distilled water. After 4 days, the aqueous phase of each bag was 
decanted. The solids remaining in the bags consisted of two types of solid. The thin upper white layer was 

35 carefully removed and discarded. The bottom solid was brown-yellow and was crystalline alcohol oxidase. A 
portion of the crystalline alcohol oxidase was dissolved in distilled water (about 10 times the volume of the 
solid) and an assay by the dye-peroxidase method showed an activity of 94 EU/mL. The specific activity of 
the alcohol oxidase was 10.4 EU/mg of protein. 

The crystalline precipitate resulting from the above-described dialysis was dialyzed against 0.05 M 

40 potassium phosphate buffer (pH 7.5), and applied to a 2x 200 cm Sephacryl 200 (Pharmacia) column 
equilibrated with the same buffer. Fractions of 3.5 mL were collected at a flow rate of 10 mL/hr and assayed 
for alcohol oxidase activity. 

The alcohol oxidase activity for reaction with methanol was determined by the following assay 
procedure (dye-peroxidase method). A dye-buffer mixture was prepared by mixing 0.1 mL of an o- 

45 dianisidine solution (1 weight % o-dianisidine in water) with 12 mL of aerated 0.1 M sodium phosphate 
buffer (pH 7.5). The assay mixture was prepared with 2.5 mL of the dye-buffer mixture, 50 nL of methanol, 
10 uL of a peroxidase solution (1 mg of horse-radish peroxidase-Sigma, Type II), and 25 uL of the alcohol 
oxidase solution. The assay mixture was maintained at 25*C in a 4x1x1 cm cuvette and the increase in 
absorbance by the dye at 460 nm was recorded for 2 to 4 minutes. The enzyme activity was calculated by 

50 

AA 

x 11 5 

Activity (p mole/min/mL or Enzyme Units/mL) = min 

55 

wher in 11.5 is a factor based on a standard curv prepar d with known aliquots of H2O2 and AA is th 
change in absorbance during the experimental int rval. 

A total of 0.1 ug of total prot in from ach fraction was also assayed for alcohol oxidas content by gel 
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electrophoresis with SDS-polyacrylamide (12%). 

EXAMPLE VIII 

5 DNA And Protein Sequencing 

Determination of DNA sequences was performed by the dideoxy chain elongation method using 
bacteriophage M13 (Sanger et a), 1980) or by the chemical modification method (Maxam and Gilbert, 1960). 
The DNA fragments corresponding to the 5' end of the alcohol oxidase gene were inserted into the 

io Ml3mp8 and Ml3mp9 yectors or end-labelled for the chemical modification method using restriction 
enzyme sites available in this region. 

The 710bp Hin6\WSa/i fragment from pPG 4.0 was end-labelled for Maxam -Gilbert sequencing by first 
digesting 33 ug of the plasmid with Hfnd\\\. The reaction mixture was phenol/chloroform/isoamyl alcohol 
extracted, chloroform/isoamyl alcohol extracted and ethanol precipitated. The DNA was collected by 

T5 centrifugation and resuspended in 31 uL of water. 100 aCi of ^P-a-dCTP (3200 Ci/mmol) and 2 units of 
Klenow fragment DNA Poll was added to the reaction mixture to give a final volume of 50 uL containing 400 
UM dATP, 400 uM dGTP, 50 mM Tris, (pH 7.4), 10 mM MgSOi. and 1 mM DTT. The reaction mixture was 
incubated at 37 *C for 1 hour and stopped by the addition of 2 uL of 0.5~M EDTA. The mixture was then 
phenol/chloroform/isoamyl alcohol extracted, chloroform/isoamyl alcohol extracted, chromatographed on a 

20 Sephadex G-50 column, and the labelled nucleic acid fractions pooled and ethanol precipitated. After 
centrifugation, the DNA pellet was resuspended in water and digested with Sa/I. The digest was elec- 
trophoresed on a 1% agarose gel in TAE buffer, and the 710 bp band was cut from the gel, the DNA 
electroeluted, butanol extracted, and ethanol precipitated. The fragment was resuspended in 100 uL of TE 
buffer, adjusted to 2.5 M ammonium acetate and then ethanol precipitated. The resulting DNA fragment was 

25 resuspended in TE buffer at a concentration of 50,000 cpm/uL. 

The four base modification reactions were performed as follows: (a) the G (guanine) reaction was 
incubated for 8 minutes at 22 • C and contained 1 uL (50,000 cpm) of the labelled DNA fragment, 4 uL of 
water, 200 uL of 50 mM sodium cacodylate, pH 8.0, 1 mM EDTA (DMS buffer) and 1 uL dimethyl sulfate. 
The reaction was terminated by the addition of 50 uL of DMS stop buffer containing 1.5 M sodium acetate, 

30 (pH 7.0), 1 M 2-mercaptoethanol and 100 ug/mL tRNA, then ethanol (750 uL) was added - and the reaction 
mixture was~held at -70 *C for at least 15 minutes, (b) the G/A (guanine/adenine) reaction was incubated for 
10 minutes at 22*C and contained 2 uL (100,000 cpm) of the labelled DNA fragment, 8 uL of water and 30 
uL of formic acid. The reaction was terminated by the addition of 200 uL of Hz stop buffer (0.3 M sodium 
acetate, pH 5.5, 0.1 M EDTA and 25 ug/mL tRNA), then ethanol (750 uL) was added and the reaction 

35 mixture held at -70*C~7or at least 15 minutes, (c) the T/C (thymine/cystosine) reaction was incubated for 10 
minutes at 22* C and contained 2 uL (100,000 cpm) of the labelled DNA fragment, 18 uL of water and 30 
uL of hydrazine. The reaction was terminated as described in (b) above, (d) the C (cytosine) reaction was 
incubated for 10 minutes at 22 *C and contained 1 uL (50,000 cpm) of the labelled DNA fragment, 4 uL of 
water, 15 uL of 5 M NaCI, and 30 uL of hydrazine. The reaction was terminated as described in (b) above. 

40 The DNA pellets were collected by centrifugation, resuspended in 250 uL of 0.3 M sodium acetate, pH 
5.5 and ethanol precipitated with 750 uL of 95% ethanol. The pellets were collected by centrifugation, dried 
under vacuum for 5 minutes, and the DNA cleaved by resuspending the pellets in 100 uL of a 1 to 10 (v/v) 
dilution of piperidine. The cleavage reaction was incubated at 90 * C for 30 minutes and terminated by the 
addition of 500 uL of 98% ethanol, 60 mM sodium acetate (pH 5.5) and 10 ug/mL tRNA. The reaction 

45 mixture was placed in a dry-ice/ethanol bath (about -70 * C) for about 5 minutes and the DNA fragments 
were collected by centrifugation. The fragments were resuspended in 50 uL of 0.3 M sodium acetate (pH 
5.5) and then ethanol precipitated with 100 lit of 95% ethanol. This ethanol precipitation was repeated, the 
pellets were washed with 95% ethanol and evaporated under vacuum during centrifugation. The pellet was 
resuspended in 10 uL of 80% formamide, 10 mM NaOH, 1 mM EDTA, 0.1% xylene cyanol and 0.1% 

so bromphenol blue. Two to three uL were electrophoTesed on a 10%~0.4 mm thick polyacrylamide gel in TBE 
buffer. 

The amino acid sequence of alcohol oxidase was determined by Sequemat, Inc. (Watertown, Mass.) 
using 2 mg of purified alcohol oxidase from Pichia pastorls. The first 18 amino acids of the mature protein 
were determined to be: 

65 
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Ala-Ile-Pro-Glu-Glu-Phe-Asp-Ile-Leu-Val-Leu-Gly-Gly-Gly-Ser- 
Gly-Ser . 

5 

EXAMPLE DC 

Determination Of Alcohol Oxidase Transcriptional Initiation Site 

10 

To determine where the start of the mRNA for alcohol oxidase was located, a primer extension 
experiment was performed using a synthetic oligonucleotide copied from the DNA sequences of the 5' end 
of the alcohol oxidase gene as primer and 10 ug of poly A+ Pichia past oris mRNA as template. Ten ug of 
Pichia pastoris poly A+ mRNA was combined with 3 ng of primer (S'-CTT CTC AAG TTG TCG-3*) in a 

75 final volume of 9.3 uL which was 43 mM NaCI, 29.2 mM Tris (pH 8.3), 5.2 mM MgCfe and 4.3 mM DTT. 
The nucleic acids were denatured at 70" C for 5 minutes and reannealled by~ allowing to slowly "cool to 
22 • C. The reannealling mix was added to a tube containing 4 uL of dNTP mix, 0.8 uL RT buffer, and 1 uL 
^P-a-dCTP (800 Ci/mmol). Three uL of this mixture was added to 1 uL of each respective dNTP. The final 
3 uL in the mixture was added to 1 uL of water. The reactions were started by the addition of 1 uL of dil 

20 RT and incubated at 42 *C for 15 minutes. The reactions were chased with 3 uL of Chase RT at 42' C for 
15 minutes. The reactions were stopped by the addition of 7.5 uL formamide dye mix and 4-5 uL were 
electrophoresed on a 0.4 mm thick gradient gel in 1x TBE. After electrophoresis the gel was fixed in 10% 
acetic acid with 10% methanol for 20 minutes. The gel was dried under vacuum and. used to expose an 
XAR X-ray film. 

25 The gradient gel was prepared as follows: 300 uL of 10% ammonium persulphate and 14 uL of TEMED 
were added to 30 mL of top gel; 75 uL of 10% ammonium persulfate and 3.5 uL TEMED were added to 7 
mL of bottom gel, 6 mL of top gel were drawn up into a pipet and then 6 mL of bottom gel were drawn into 
the same pipet. The gel was poured between the gel plates followed by 22 mL of top gel. 

30 EXAMPLE X 

mRNA Hybridization-Selection And In Vitro Translations 

Positive hybridization-translation experiments were performed by linearizing twenty ug of cloned Pichia 

35 genomic DNA (prepared as described in Example VI) by digestion with various restriction endonucleases. 
This DNA was denatured by making the solution 0.3 M NaOH and incubating at 65 *C for 10 minutes. The 
denatured DNA-containing solution was quickly chilled on ice and neutralized by adjusting to 0.5 M 
Tris*HCI (pH 7.4). An equal volume of 20x SSPE was added to the denatured DNA immediately prior to 
binding the DNA to the nitrocellulose filters. Prior to applying the DNA to the nitrocellulose filters (Schleicher 

40 and Schuell BA83, 9 mm dia.), the filters were prepared by wetting with H 2 0, boiling for 10 minutes and 
rinsing three times in 10x SSPE. Ten ug of DNA was then bound to each filter by applying the DNA to the 
filter, air drying and finally drying the filters under vacuum at 70 * C for 2 hours. 

Prior to prehybridization, the filters with the bound DNA were placed in 1 mL of sterile water and heated 
for one minute at 100*C, cooled on ice, rinsed by vortexing in 1 mL of sterile water and rinsed with 1 mL of 

45 prehybridization buffer. The filters were prehybridized in 1 mL of prehybridization buffer, then 40 ug 
(2ug/mL ETS) of poly A+ mRNA was added directly to the prehybridization buffer. The hybridization 
mixture was heated at 65* C for 10 minutes and then incubated at 42* C for 24 hours. 

Following the hybridization, filters were washed briefly 2 times in 1x SSPE which contained 0.5% SDS 
at 22* C, 7 times in 1x SSPE which contained 0.5% SDS at 50* C for 5 minutes each, 3 times in 0.1 x SSPE 

so at 50* C for 5 minutes each, and once in 0.1x SSPE at 65* C for 10 minutes. The RNA was eluted from the 
filters by boiling for 1 minute in 300 uL of hfeO containing 15 ug of rabbit liver tRNA. The eluted RNA was 
quickly frozen In a dry-ice ethanol bath. The RNA mixture was allowed to warm to room temperature and 
the filters removed. The RNA mixture was then precipitated by adjusting the medium to 2.5 M ammonium 
acetate and precipitating with ethanol 2 times, and finally resuspended in 100 uL of H2O before being 

55 lyophilized. 

Translations wer performed according to standard techniques known by those of skill in the art, such 
as for example, instructions provided by New England Nuclear in vitro rabbit reticulocyte lysate translation 
kits. Protein products wer lectrophoresed on 8% poly aery lam ide gels containing a 4.5% stacking get. 
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EXAMPLE XI 

Antisera Preparations And Immunoprecipitations 

5 Antisera raised in rabbits against an extract from Pichia pastoris cells containing both p72 (alcohol 
oxidase) and p76 polypeptides were prepared using standard protocols. Extracts were dialyzed against PBS 
before injections. Over a course of 8 weeks, each rabbit received 3 injections each of which consisted of 1 
mg of total protein in a volume of 0.1 mL with 0.2 mL of Freunds complete adjuvant. Injections were 
intradermal^ delivered to 6-10 sites in the rabbit. At the end of eight weeks, the rabbits were bled, and their 

10 sera tested against purified Pichia pastoris alcohol oxidase by the Ouchterlony double diffusion procedure. 
Purified rabbit anti-p72 (alcohol oxidase) and anti-p76 protein antibodies were prepared by affinity 
chromatography of whole antisera through a CNBr coupled p72 (alcohol oxidase)-p76 Sepharose 4B column 
(Pharmacia). One gram of CNBr activated gel was prepared by hydrating the gel for 15 minutes in 200 mL 
of 1 mM HCI followed by 3x 50 mL washes in coupling buffer (0.1 M sodium carbonate (pH 8) and 0.5 M 

75 NaCI). Five mL of a 6 mg/mL solution of p72-p76 in coupling buffer was added to the gel and gently 
agitated overnight at 4*C. Unbound protein was removed by washing 3x 50 mL with coupling buffer. The 
remaining active groups were eliminated by a 2-hour incubation in 1 M ethanolamine (pH 8). Non-covalently 
bound material was removed from the gel by a 50 mL wash with 2 M sodium thiocyanate in PBS. Prior to 
chromatography of the antisera, the gel was finally washed with 3x 50 mL of PBS. Five mL of clarified anti 

20 p72-p76 antisera were mixed with the gel and incubated with gentle agitation for 2 hours at 4*C. The 
antisera-gel mixture was then pipetted into a 1x 8 cm column and washed with 150 mL of PBS. Purified 
antibody was eluted from the column with 6 mL of 2 M sodium thiocynate in PBS. After elution from the gel, 
the purified antibody was dialyzed extensively againstPBS which contained 0.02% sodium azide. 

The affinity-purified antisera was added to an in vitro translation mix in PBS, 1 % NP40 and incubated 

25 overnight at 4 • C. The antibody-antigen complex was precipitated with Pansorbin (Calbiochem) on ice for 
2.5 hours. Pansorbin was prepared by washing in RIPA buffer. Pansorbin precipitates were washed 4 times 
in RIPA buffer and dissolved in Laemmli loading buffer before electrophoresis. 

EXAMPLE XII 

30 

Pichia pastoris Transformation Procedure 

A. Cell Growth 

35 1. Inoculate a colony of Pichia pastoris GS115 (NRRL Y-1 5851) into about 10 mL of YPD medium and 
shake culture at 30 • C for 1 2-20 hrs. 

2. After about 12-20 hrs., dilute cells to an ODeoo of about 0.01-0.1 and maintain cells in log growth 
phase in YPD medium at 30 • C for about 6-8 hrs. 

3. After about 6-8 hrs, inoculate 100 mL of YPD medium with 0.5 mL of the seed culture at an ODeoo of 
40 about 0.1 (or equivalent amount). Shake at 30 * C for abour 12-20 hrs. 

4. Harvest culture when ODeoo is about 0.2-0.3 (after approximately 16-20 hrs) by centrifugation at 1500 
g for 5 minutes. 

B. Preparation of Spheroplasts 

4$ 

1 Wash cells once in 10 mL of sterile water. (All centrifugations for steps 1-5 are at 1500 g for 5 
minutes.) 

2. Wash ceils once in 10 mL of freshly prepared SED. 

3. Wash cells twice in 10 mL of sterile 1 M Sorbitol, 
so 4. Resuspend cells in 10 mL SCE buffer" 

5. Add 5-10 uL of 4 mg/mL Zymolyase 60,000 (Miles Laboratories). Incubate cells at 30 *C for about 30- 
60 minutes. 

Since the preparation of spheroplasts is a critical step in the transformation procedure, one should 
monitor spheroplast formation as follows: add 100 uL aliquots of cells to 900 lit of 5% SDS and 900 uL 
55 of 1 M Sorbitol befor or just after th addition of zymolyase and at various times during th incubation 
period! Stop the incubation at the point where cells lyse in SDS but not in sorbitol (usually between 30 
and 60 minutes of incubation). 

6. Wash spheroplasts twice in 10 mL of sterile 1 M Sorbitol by centrifugation at 1000 g for 5-10 minutes. 
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(The time and speed for centrifugation may vary; centrifuge enough to pellet spheroplasts but not so 
much that they rupture from the force.) 

7. Wash cells once in 10 mL of sterile CaS. 

8. Resuspend cells in total of 0.6 mL of CaS. 

C. Transformation 

1. Add DNA samples (up to 20 uL volume) to 12 X 75 mm sterile polypropylene tubes. (DNA should be 
in water or TE buffer; for maximum transformation frequencies with small amounts of DNA, it is advisable 
to add about 1 u,L of 5 mg/mL sonicated E. coll DNA to each sample.) 

2. Add 100 uL of spheroplasts to each DNA sample and incubate at room temperature for about 20 
minutes. 

3. Add 1 mL of PEG solution to each sample and incubate at room temperature for about 15 minutes. 

4. Centrifuge samples at 1000 g for 5-10 minutes and decant PEG solution. 

5. Resuspend samples in 150 uL of SOS and incubate for 30 minutes at room temperature. 

6. Add 850 uL of sterile 1 M Sorbitol and plate aliquots of samples as described below. 

D. Regeneration of Spheroplasts 

1 . Recipe for Regeneration Agar Medium: 

a. Agar-Sorbitol- 9 g Bacto-agar, 54.6 g Sorbitol, 240 mL H 2 O p autoclave. 

b. 10X Glucose- 20 g Dextrose, 100 mL H 2 0, autoclave. 

c. 10X SC- 6.75 g Yeast Nitrogen Base without amino acids, 100 mL H 2 0, autoclave. (Add any 
desired amino acid or nucleic acid up to a concentration of 200 ug/mL before or after autoclaving.) 

d. Add 30 mL of 10X Glucose and 30 mL of 10X SC to 240 mL of the melted Agar-Sorbitol solution. 
Add 0.6 mL of 0.2 mg/mL biotin and any other desired amino acid or nucleic acid to a concentration 
of 20 ug/mL. Hold melted Regeneration Agar at 55-60" C. 

2. Plating of Transformation Samples: 

Pour bottom agar layer of 10 mL Regeneration Agar per plate at least 30 minutes before 
transformation samples are ready. Distribute 10 mL aliquots of Regeneration Agar to tubes in a 45-50 # C 
bath during the period that transformation samples are in SOS. Add aliquots of transformation samples 
described above to tubes with Regeneration Agar and pour onto bottom agar layer of plates. Add a 
quantity of each sample to 10 mL aliquots of melted Regeneration Agar held at 45-50*0 and pour each 
onto plates containing a solid 10 mL bottom agar layer of Regeneration Agar. 

3. Determination of Quality of Spheroplast Preparation: 

Remove 10 uL of one sample and dilute 100 times by addition to 990 UL of 1 M Sorbitol. Remove 
10 uL of the 100 fold dilution and dilute an additional 100 times by addition to a second 990 uL aliquot 
of 1 M Sorbitol. Spread plate 100 uL of both dilutions on YPD agar medium to determine the 
concentration of unspheroplasted whole cells remaining in the preparation. Add 100 uL of each dilution 
to 10 mL of Regeneration Agar supplemented with 40 ug/mL histidine to determine total regeneratable 
spheroplasts. Good values for a transformation experiment are 1-3 X 107 total regeneratable 
spheroplasts/mL and about 1 X 10 3 whole cells/mL 

4. Incubate plates at 30* C for 3-5 days. 

Example XIII 

Isolation Of Pichia Pastoris HIS4 Gene And Autonomous Replication Sequences 

A. Strains 

The strains employed include: 

(a) Pichia pastoris strain NRRL Y-11430; 

(b) Pichia pastoris GS1 15 (his4; NRRL Y-15851); 

(c) S. cerevisiae strain 5799-4D (a /j/s4-260 /)/s4-39; NRRL Y-15859); and 

(d) E. coll strain 848 (F~ met thi gal Ti R 4>8Qp hsdFT hsdM*). 

B. Plasm ids 
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pYA2 (se Figure 23; consists of the S. cerevlsiae HIS4 gene on a 9.3 kbp Psfl fragment inserted at 
the Psfl site of pBR325) was the source of the S. cerevisiae HIS4 gene fragments and has been deposited 
in an E. coli host and is available to the public as NRRL B-15874. 

YEp13 is available from the American Type Culture Collection and has been assigned accession 
5 number ATCC 371 1 5. 

C. Media 

Pichia pastoris was^grown in YPD (rich) or IMG (minimal) media. IMG, a minimal medium, consists of 
io the following: 

1. IMi Salts at a final concentration of 36.7 mM KH2PO4. 22.7 mM (NH+feSOi. 2.0 mM MgS04*7H 2 0, 
6.7 mM KCI, 0.7 mM CaCl2*2H 2 0 ( prepared as a 1 0x stock solution and autoclaved; 

2. Traoe Salts at a~final concentration of 0.2 uM CuSCVSI-feO, 1.25 uM Kl, 4.5 uM MnSC>4*H 2 0, 2.0 
UM NaMo04*2H 2 0, 0.75 uM H3BO3, 17.5 uM ZnS04*7H 2 0. 44.5 uM FeCl3*6H 2 0. prepared as a 400x 

75 stock solution and filter sterilized; 

3. 0.4 ug/mL biotin; and 

4. 2% dextrose. 

E. coif was cultured in either LB medium or 2B medium (0.2% NH+PO+, 1.2% Na 2 HP04. 0.013% 
MgSO+"7H 2 0. 0.074% CaCI 2 *2H 2 0, 1 ug/mL thiamine and 0.4% dextrose) supplemented with 100 ug/mL 
20 tryptophan, and 0.2% Casamino acids. 

D. DNA Isolation 

1 . Large Scale Preparations of Yeast DNA. 

25 

Both Pichia pastoris and S. cerevisiae DNA preparations were carried out by growing yeast cells in 
100 mL of minimal medium until Aeoo equals 1-2 and then harvesting the cells by centrifugation at 2,000 g 
for 5 minutes. The cells were washed once in H 2 0, once in SED, once in 1 M sorbitol and then suspended 
in 5 mL of 0.1 M Tris-HCI (pH 7.0) which is 1 M in sorbitol. The cells were~mixed with 50-100 uL of a 4 

30 mg/mL solution of Zymotase 60,000 (Miles Laboratories) and incubated at 30" C for 1 hour to digest the cell 
walls. The spheroplast preparation was then centrifuged at 1000 g for 5-10 minutes and suspended in Lysis 
buffer (0.1% SDS, 10 mM Tris-HCI, (pH 7.4), 5 mM EDTA and 50 mM NaCI). Proteinase K (Boehringer 
Mannheim) and RNase A (Sigma) were each added to 100 ug/mL and the mixtures incubated at 37 *C for 
30 minutes. DNA was deproteinized by gently mixing the preparation with an equal volume of chloroform 

35 containing isoamyl alcohol (24:1, v/v) and the phases were separated by centrifugation at 12,000 g for 20 
minutes. The upper (aqueous) phase was drawn off into a fresh tube and extracted with an equal volume of 
phenol/chloroform/isoamyl alcohol. The phases were separated as before and the top phase placed in a 
tube containing 2-3 volumes of cold 100% ethanol. The sample was gently mixed and DNA was collected 
by spooling onto a plastic rod. The DNA was immediately dissolved in 1 mL of TE buffer and dialyzed 

40 overnight at 4* C against 100 volumes TE buffer. 

2. Small Scale Yeast DNA Preparations . 

Five mL of yeast cultures in minimal medium were grown until Aeoo equals 1-5 and harvested by 
45 centrifugation at 2,000 g for 5 minutes. Cells were suspended in 1 mL of SED and transferred to a 1 .5 mL 
microfuge tube, washed once in 1 M sorbitol and resuspended in 0.5 mL of 0.1 M Tris-HCI (pH 7.4) which is 
1 M in sorbitol. Zymolase 60,000 (10 uL of a 4 mg/mL solution) was added and the cells were incubated for 
30-60 minutes at 30 "C. Cells were then centrifuged for 1 minute, suspended in the Lysis buffer and 
incubated at 65-70 *C. After 15 minutes the samples were mixed with 100 itL of 5 M potassium acetate, 
50 held in an ice bath for 15 minutes and centrifuged for 5 minutes. The supematants were decanted into a 
fresh microfuge tube containing 1 mL of 100% ethanol, mixed and centrifuged for 10 seconds. Finally, the 
DNA pellets were air dried for 10-15 minutes and dissolved in 50 uL of TE buffer. 

3. Large Scale E. coii DNA Isolations . 

55 

E. coli cultures for large scale (0.5-1 L) plasmid preparations were grown at 37 *C with shaking in 2B 
medium supplemented as described abov and with the appropriate antibiotic. For cells which contained 
pBR322 derived plasmids, cultures were grown to an Asso of about 0.7 at which time sufficient chloram- 
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phenicot was added to give a concentration of 100 ug/mL and cells harvested approximately 15 hours later. 
Strains which contained pBR325 derived plasmids were inoculated into the supplemented 2B medium at a 
starting Asso of about 0.01-0.05 and incubated with shaking at 37 "C for 20-24 hours before harvesting. 

5 4. Small Scale E. coli DNA Preparations . 

For small scale rapid plasmid isolations, 2 ml_ cultures in the supplemented 2B medium with antibiotic 
were grown overnight at 37 *C with shaking and harvested by centrifugation in 1.5 mL microfuge tubes. 
Plasmids from all preparations were isolated by the alkaline lysis method described by Birnboim and Doly 
10 (1979). 

E. Restriction DNA and Fragment Isolation. 

Restriction enzymes were obtained from New England Biolabs and Bethesda Research Laboratories 
75 and digestions were performed by routine techniques. Restriction mappings were carried out by comparing 
parallel digestions of plasmids with and without insert DNA. Restriction fragments were purified by 
electroelution from agarose gels into Whatman 3 MM paper strips backed by dialysis tubing. The fragments 
were recovered from the paper and tubing by 3-4 washings with 0.1-0.2 mL volumes of a solution which 
contained 0.1 M NaCI. 50 mM Tris-HCI (pH 8.0) and 1 mM EDTA. Finally, the fragments were extracted with 
20 phenol/chloroform/tsoamyl alcohol, precipitated with ethanol and redissolved in a small volume of TE buffer. 

F. P. pastorls Library Construction in E. Coll. 

For the Pichia pastoris DNA-YEpl3 library construction, 100 ug of YEpl3 was digested to completion 

25 with BamHi and treated with calf intestinal alkaline phosphatase to remove the terminal 5* phosphate from 
the DNA. A 100 ug aliquot of wild type Pichia pastoris DNA from Pichia pastoris NRRL Y-11430 was 
partially digested with 10 units of Sau3A I by incubation for 5 minutes at 37* C in a total volume of 1 mL. 
Fragments of 5 to 10 kbp were size selected by centrifugation through 5-20% sucrose gradients. One ug of 
the vector and 2 ug of the Pichia Sau3A I fragments were mixed with 20 units of T4 DNA ligase (Bethesda 

30 Research Laboratories) in a total volume of 200 uL and incubated overnight at 4*C. The ligated DNAs were 
transformed into E. coll by adding the entire ligation reaction mix to 2 mL of competent E. coll 848 cells 
and incubating for 15 minutes at 0*C. The mixture was warmed to 37 *C for 5 minutes after which time 40 
mL of LB medium was added and the 37 *C incubation continued for another 1 hour. Ampicillin was then 
added to give a total concentration of 100 ug/mL and the incubation continued for a second hour. Finally. 

35 the cells were centrifuged for 10 minutes at 3,000 g, resuspended in 1 mL of fresh LB medium and spread 
in equal aliquots on 10 LB agar plates containing 100 ug/mL of ampicillin. The approximately 50,000 
colonies which resulted were scraped from the plates and a portion of the cells was inoculated into 500 mL 
of the supplemented 2B medium at a starting Asso of about 0.1. The culture was grown and plasmid was 
extracted as described above. Of the colonies which were pooled for the library, 98 out of 1 00 tested were 

40 tetracycline sensitive and 7 out of 10 examined contained plasmids with insert DNA. 

For the Pichia pastoris DNA-pYJ8A C/a library construction, 50 ug of pYJ8AC/a was digested to 
completion with C/al and treated with calf intestinal alkaline phosphatase to remove the terminal 5' 
phosphate from the DNA. A 100 ug aliquot of DNA from Pichia pastoris NRRL Y-15851 was partially 
digested with 20 units of Taq\ by incubation for 5 minutes at 65 * C in a total volume of 1 mL. Fragments of 

45 5 to 10 kbp were size selected by electroelution from a 0.5% agarose gel (See Example II, Section E). One 
ug of the vector and 2 ug of the Pichia Taq\ fragments were mixed with 20 units of T4 DNA ligase 
(Bethesda Research Laboratories) in a total volume of 200 uL and incubated overnight at 4"C. The ligated 
DNAs were transformed into E. coli by adding the entire ligation reaction mix to 2 mL of competent £F. coli 
848 cells and incubating for 15 minutes at 0* C. The mixture was warmed to 37* C for 5 minutes after which 

50 time 40 mL of LB medium was added and the 37* C incubation continued for another 1 hour. Ampicillin was 
then added to give a total concentration of 100 ug/mL and the incubation continued for a second hour. 
Finally, the cells were centrifuged for 10 minutes at 3,000 g, resuspended in 1 mL of fresh LB medium and 
spread in equal aliquots on 10 LB agar plates containing 100 Ug/mL of ampicillin. The approximately 10,000 
colonies which resulted were scraped from the plates and a portion of the cells was inoculated into 500 mL 

55 of th supplemented 2B medium at a starting Asso of about 0.1. The culture was grown and plasmid was 
extracted as described above. 

G. South m Hybridizations. 
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For transfer of large or supercoiled DNA molecules to nitrocellulose, DNA was first partially hydrolyzed 
by soaking agarose gels in 0.25 M HCI for 10 minutes prior to alkali denaturation. The hybridization of 
labelled fragments from the $. cerevisiae HIS4 gene to Plchfa pastoris DNA was performed in the 
presence of 50% formamide. 6x SSC, 5x Denhardfs. 0.1% SDS. 1 mM EDTA. and 100 ug/mL denatured 
5 herring sperm DNA at 42 'C. Post-hybridization washes were in 1x SSC, 1 mM EDTA, 0.1% SDS and 1.0% 
sodium pyrophosphate at 65 * C. DNA was ^P-labelled as described in Example IV. 

H. DNA Sequencing . 

io DNA sequencing was by the dideoxynucleotide chain termination method of Sanger ef ai (1980). 

I. Yeast Transformations 

5. cerevisiae transformations were carried out by the spheroplast generation method of Hinnen ef a/ - 
T5 (1978). 

Pichia pastoris transformations were performed following the procedure described above. 
J. Analysis of Pichia pastoris Transformants 

20 The ability of each plasmid to be maintained as an autonomous element in Pichia pastoris cells was 
determined as follows: A transformant colony was picked from the regeneration agar plate and streaked 
onto an SD medium agar plate and inoculated into liquid IMG medium. The SD plate was incubated at 
30 * C for 3 days after which time a single colony was picked from this plate, streaked onto a second SD 
plate and inoculated into a second flask of IMG medium. This process was repeated a third time. The 3 IMG 

25 cultures were grown at 30*0 with shaking to an Agoo of about 1-2 and then harvested by centrifugation. 
DNA from the yeast cultures was extracted as described above, electrophoresed at 30 Volts and 30 mAmps 
for 10-15 hours into 0.8% agarose gels, transferred to nitrocellulose and hybridized to ^P-labelled pBR322 
or pBR325 as described above. As controls, a sample containing 10 ng of plasmid isolated from E. coii and 
a sample containing 1-2 ug of untransformed Pichia pastoris GS115 DNA were electrophoresed in parallel 

30 with the experimental samples. 

K. Isolation of Pichia DNA Sequences. 

DNA fragments which contained the Pichia HIS4 gene were isolated from a Pichia DNA library by their 
35 ability to complement S. cerevisiae hfs4 strains. The library was composed of 5-20 kbp SaU3A\ partial 
digestion fragments of wild type Pichia DNA inserted into the BamH\ site of the S. cerevlslae-E. coil 
shuttle vector YEp13. Spheroplasts of S. cerevisiae NRRL Y-15859 (5799-4D; a /)/$4ABC~ strain) were 
generated by the technique of Hinnen ef al (1978), mixed with the Pichia DNA library and allowed to 
regenerate in a medium deficient in histidine. The transformation resulted in about 1x10 3 prototrophic yeast 
40 colonies from a population of 5x10 7 total regeneratable spheroplasts. Total yeast DNA was extracted from 
20 of the His + colonies and transformed into E. coif. Seventeen of the yeast DNA preparations produced 
ampicillin resistant colonies and each contained plasmid comprised of YEp13 plus insert DNA. To confirm 
that the His+ transforming plasmids contained the Pichia HIS4 gene and not a DNA fragment with 
suppressor activity, restriction digests of the plasmids were hybridized to a labelled DNA fragment 
45 containing a large portion of the 5. cerevisfae HIS4 gene and washed at low stringency. Each of the 
plasmids which complemented the hls4 S. cerevisfae strains contained sequences which hybridized to the 
S. cerevisfae HIS4 gene. 

To search for DNA fragments which contain Pichia ARS activity, DNA from Pichia pastoris GS115 
(NRRL Y-15851) was partially digested with TaqX and 5 to 10 kbp fragments were isolated and cloned into 

so the unique C/al site of pYJ8AC/a. (See Figure 26). Plasmid DNA was recovered from about 10,000 His + 
Pichia colonies and used to transform E. coll Plasmids from about 10,000 ampicillin resistant colonies were 
isolated and then transformed back into GS115. Forty of the His+ yeast colonies from this sublibrary 
transformation were separately streaked onto selective medium and grown in separate cultures in selective 
medium. Total yeast DNA was extracted from each of these 40 cultures and transformed into E. cofi. Two 

ss plasmids, pYA63 (PARS1) and pYA90 (PARS2) whose yeast DNA preparations produced the most ampicillin 
resistant E. coif colonies, were s I cted for furth r analysis. Both of th se plasmids transformed Pichia 
pastoris GS115 (NRRL Y-15851) at a very high frequency and each contained an insert of foreign DNA. 
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EXAMPLE XIV 

Construction Of Regulatory Region-/acZ Gene Fusions 

5 A. p72 (Alcohol Oxidase) Regulatory Region Construc ts 

Plasmid pPG 4.0, a pBR322 vector which contains the 4 kilobase pair EcoRI-Pw/ll genomic DNA 
fragment from Pichia pastoris was cut with Fsfl. treated with S1 nuclease to produce blunt ends where 
cleavage occurred, then cut with flamHI to give a 1.12 kbp DNA fragment which contains the alcohol 
10 oxidase regulatory region and the coding information for the first 1 5 amino acids of alcohol oxidase. This 
1.12 kbp DNA fragment has the following nucleotide sequence: 



75 1.12 kbp I 

"Si nuclease" CTA GGT GGT G 

treated end GAT CCA CCA CCT AG. 

t 
B 

20 alcohol oxidase leu lx gly X 2 glyi3 



This 1.12 kbp was ligated into the EcoR\f$maVBamH\ linker (cleaved with BamH\ and Sma\) of the E. colLS. 
cerev/siae shuttle vector pSEY101, Douglas ef a/ (1984). Vector pSEY101 contains the E. coli /acZ gene 
25 and a polylinker with the following nucleotide sequence: 

R x Sm B 

30 . . . .GAATTCCCGGGGATCCC GTC GTT 

CTTAAGGGC C CC TA GGG CAG CAA 

t f t 

R x Sm B Valg Val x Q . - p -galactosidase 



35 



40 



to give hybrid plasmid pTA01 1 (See Figure 29). 

Since the regulatory region-/acZ gene fusion of pTA011 is out of phase with respect to production of 0- 
galactosidase as shown in Sequence E: 



45 



so 



55 
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Vector pTA01 1 is cleaved at the unique flam HI site and the following Sma\ linker inserted: 

55 
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Sm 

GATCACCCGGGT 
T G G G.C CCA 

t 

Sm 



C T A G 



thus producing hybrid vector pTA012, which has the following nucleotide sequence with respect to the 
regulatory region-/acZ fusion: 
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and thus, the regulatory region-/acZ fusion of pTA012 is still out of phase with respect to the LacZ reading 
55 frame. In order to bring the N-terminal coding information for the alcohol oxidas structural gene into an 
op n reading frame with the structural lacZ gene, pTA012 was treated with EcoR\-Sma\ and th r suiting 
DNA fragment ligated into pSEY101 which had similarly been treated with EcoH\ and Sma\ thus producing 
hybrid vector pTA013 (See Figure 30 and th nucleotid sequence below: 
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The vector pTA013 was then used to transform $. cerevfsfae SEY2102 for further studies described below 
55 in Exampl XV. 

V ctor pTA013 was then cleaved with Pst\ or Pst\-Nru\ and the regulatory region-/acZ fusion contained 
in the cleaved fragment ligat d with the HIS4 gene-containing fragment of shuttl vectors pTAFHI (See 
Figure 28), pYJ30(S Figur 27) or pYA2(Se Figur 23) to giv , r spectively, plasmids pSAOH 1 , pSAOH 
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5 and pSAOH 10. 



PTA013 plus 


Resulting Plasmids 


pTAFHI 

pYJ30 

pYA2 


pSAOHl 
pSAOHS 
pSAOHl 0 



B. p76 Regulatory Region Constructs 

Regulatory region - /acZ gene fusions were prepared as follows with the p76 regulatory region. 

1 . Using the Entire 5* Portion of pPG 6.0 

The 1.35 kbp Eco RI fragment of pPG 6.0 was cloned into the unique EcoRI site of pSEY101, an E. coii- 
S. cerevisiae shuttle vector, giving plasmid pT76U1 (see Figure 30a). Vector pT76U1 was then cleaved 
with Pstl-Nrul. and the larger DNA fragment ligated with the HIS4 gene-containing fragment of shuttle vector 
pTAFHI (Figure 28) as described above to give pT76H3; or with the EcoRI-end filled in Pstl-EcoRI fragment 
of shuttle vector pBPM (see Figure 34) to give pT76H4. 

2. Using a Bal31 Digest of 5'-pPG 6.0 

A 1.35 kb pair EcoRI fragment of pPG 6.0 was cloned into the unique EcoRI site of pSEY8, an E. coif-$. 
cerevisiae shuttle vector, which also has a unique Saf\ site adjacent to the EcoRI site into which the Pichia 
DNA was inserted, thus giving plasmid pTA01 (See Figure 31). The plasmid pTA01 was cleaved with Sa/I, 
treated with fla/31 exonuclease to produce blunt-ended fragments of polypeptide p76 regulatory region of 
various lengths. The blunt-ended fragments were freed from the remainder of plasmid pSEY8 by cleavage 
with EcoRI. The resulting DNA fragments were cloned into the EcoRI-Sma\ linker of pSEY101 to give, 
among others, plasmid pTAF.85 (see Figure 32). Plasmid pTAF.85 is analogous to pTA011 shown in Figure 
29, with the p76 regulatory region instead of the p72 (alcohol oxidase) regulatory region. 

Vector pTAF.85 was then treated in an analogous fashion as vector pTAO!3 to give plasmids 
pTAFH.85, pT76H1 and pT76H2. Thus, the following vectors were cleaved and ligated: 

35 



pTAF.85 plus 


Resulting Plasmid 


pTAFHI 

pYJ30 

pYA2 


pTAFH.85 

pT76H1 

pT76H2 



EXAMPLE XV 

45 Regulation Of jS-Galactosidase Production In Pichia pastorfs 

The production of £-galactosidase by several Pichia pastoris GS115 (NRRL Y-15851) transformants 
grown on different carbon sources and under different conditions was investigated. The transformed strains 
were grown in minimal medium containing 0.5 ug/mL of biotin and 0.1% glucose at 30' C until they reached 

so stationary phase. The cells were then collected by centrifugation and transferred to minimal medium 
containing 0.5 ug/mL of biotin and 0.5% methanol and grown for about 3-5 generations at 30 * C. After this 
initial growth on methanol, cells were collected by centrifugation and transferred to fresh minimal medium 
containing 0.5 ug/mL of biotin and either 0.1 % glucose or 0.3% methanol as carbon source. The cells were 
then incubated at 30 *C for about 80 hours, with samples being periodically withdrawn to determine the 

55 alcohol oxidas and tf-galactosidase I vels. After about 20-50 hours, the carbon source was depleted and 
thereafter the cells were maintained under th se carbon source starvation conditions. The results are 
summariz d in Table I. 
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Alcohol oxidase was determined by the dye-peroxidase method described above (See Example VII) and 
55 £-galactosidase was determin d as follows: 

0-Galactosidas Assay 
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A. Solutions requir d: 



Z-buffer: 



Final concentration 



5 



70 



Na2HP0* * 7H 2 0 16.1 g 

NaH 2 PCU 5.5 g 

KC1 0.75 g 

MgSO* * 7H 2 0 0.246 g 

2-mercaptoethanol 2.7 mL 



0.06 M 
0.04 M 
0.01 M 
0.001 M 
0.05 M 



fill up to 1L; pH should be 7 



75 O-Nitrophenyl-l-D-glactoside (ONPG) : 

Dissolve 400 mg ONPG (Sigma N-1127) in 100 mL of distilled water to make a 4 mg/mL ONPG 
solution. 

20 t B. Assay Procedure: 

1. Withdraw an aliquot from the culture medium (0.1-0.5 ODgoo of yeast cells), centrifuge and wash cell 
pellet with water. 

2. Add 1 mL of Z buffer to the eel! pellet, 30 uL of CHCb and 30 iiL of 0.1% SDS. vortex, incubate 5 
25 minutes at 30 * C. 

3. Start reaction by adding 0.2 mL of ONPG (4 mg/mL). vortex. 

4. Stop reaction by adding 0.5 mL of a 1 M Na2C0 3 solution at appropriate time points A* 2 o<1). 

5. Read absorbance of supernatant at 420 nm. 

30 C. Calculation of ff-galactosidase Activity Units : 

1 U = 1 nmole of orthonitrophenol (ONP) formed per minute at 30 * C and a pH 7. 
1 nmole of ONP has an absorbance at 420 nm (A* 20) of 0.0045 with a 1 cm pathlength; thus, an 
absorbance of 1 at 420 nm represents 222 nmole ONP/mL. or 378 nmole/1.7 mL since the total volume of 
35 supernatant being analyzed is 1 .7 mL. Hence, Units expressed in the Tables are calculated: 



The results presented in Table I indicate that a protein foreign to yeast, i.e., 0-gaIactosidase, can be 
produced in Pichia pastoris regulated either by the presence of methanol in the nutrient medium or by 
carbon source starvation after growth on a catabolite repressing carbon source. 

45 

EXAMPLE XVI 

Regulation Of 0-Galactosidase Production In S. cerevisiae 



50 Saccharomyces cerevisiae SEY2102, a strain requiring uracil, leucine and histidine supplementation 
for survival, was transformed with plasmids pTA013 and pT76U1. Transformed organisms were readily 
isolated by selecting for colonies which did not require uracil supplementation for growth. The isolated 
transformants, have been given the laboratory designation SEY2102-pTAO13, and SEY2102-pT76U1, 
respectively. SEY2102-pTA013 has been deposited with the Northern Regional Research Center in Peoria, 

55 Illinois to insure access to th public as of the deposit date of August 31, 1984. This strain has been 
assigned accession number NRRL Y-15858. 

Cells of NRRL Y-15858 and SEY2102-pT76U1 wer incubat d at 30 "C for about 3-4 generations in 
minimal medium containing 20 ug/mL of histidine and leucin and 5% glucos . Cells were then shifted, i. 



U = A 4 go 
t(min) 



X 378 



40 
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collected by c ntrifugation and transferred into YP medium with 3% of th carbon sourc indicated in Table 
II and grown for about 5 generations at 30 * C. Cells were then incubated for an additional 50 hours under 
carbon source starvation conditions, and periodically sampled for 0-galactosidase. The results are summa- 
rized in Table II. 

TABLE II 

Production of B-Galactosidase by S. cerevisiae 

P-galactosidase, Units /ODi;nn 

A. Alcohol oxidase regulatory region (pTA013) 
Carbon After 5 Starvation Conditions 



Source (3%) 


Generations 


6 hrs. 


20 hrs. 


50 hrs. 


glucose 


0.2 


105 


66 


nd 


fructose 


0.3 


30 


31 


28 


ethanol 


23 


137 


115 


77 


glycerol 


640 


806 


656 


788 


galactose 


982 


960 


651 


766 


B.p76 regulatory region (pT76Ul) 






Carbon 


After 5 


Starvation Conditions 


Source (5%) 


Generations 


12 hrs. 


25 hrs. 


30 hrs. 


glucose 


2-3 


254 


815 


803 


glycerol 


470 


nd 


nd 


nd 



/Ac( s U*l <ttfeltLUU£c( 

These results indicate that a protein foreign to yeast, i.e., 0-galactosidase. can be produced by 
Saccharomyces cere vis iae regulated by the p72 (alcohol oxidase) and p76 regulatory regions under 
conditions of carbon source starvation when a catabolite repressing carbon source is employed for growth, 
or by growth of transformed 5. cerevfsiae cells on a relatively non-catabolite repressing carbon source 
such as glycerol or galactose. 

The expression levels for 0-galactosidase in S. cerevisfae under the control of the regulatory regions of 
this invention can be compared to the expression levels possible with other $. cerevisfae regulated 
promoters. 



Promoter 


Carbon Source 


)S-galactosidase, Units/ODgoo 


cytochrome C-lacZ fusion (CYC1 ) 
galactose permease-lac-Z fusion (GAL2) 
Invertase-lacZ fusion"(SUC2) 


Raffinose (3%) 
Galactose (2%) 
Glucose (0.1%) 


t 460 
450 
160 



It is se n that the regulatory regions of the invention surprisingly ar at least as eff ctive as, or more 
effective as S. cerevisiae promoters than promot rs native to S. cere vis iae. 

EXAMPLE XVII 
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Southern Hybridizations With Yeast Genomic DNA 

Nine different methanol assimilating yeasts and one methanol non-assimilating yeast were grown on 
minimal media (IM1, See Example 1) plus 0.75% methanol or 1.0% glucose, respectively. Total 

s chromosomal DNA was isolated as described in Example VI, i.e., total nucleic acids were isolated, treated 
with RNase A, extracted first with phenol/chloroform/isoamyl alcohol, then with chloroform/isoamyl alcohol 
and finally ethanol precipitated. The precipitated DNA was resuspended in a minimum volume of TE buffer 
and centrifuged to clean the DNA solution. 

Southern hybridization filters were prepared as described in Example VI, i.e., 10 tig of total DNA from 

to various yeast species was digested with excess HincftW, electrophoresed, DNA denatured, the gel neutral- 
ized and DNA transferred to nitrocellulose filters. Prehybridization conditions for these filters included 
treatment with 50% deionized formamide, 6x SSC, 5x Denhardt's, 1 mM EDTA, 0.1% SDS and 100 ug/mL 
denatured salmon sperm DNA, at 42 *C overnight. The same conditions were used as a hybridization 
medium using ^P-nick-translated probes at a final concentration of 10 6 cpm/mL. The probes included the 

is cDNA inserts (Psfl fragments) from clones pPC 8.3, pPC 15.0 and pPC 6.7, as well as a 2.7 kbp BglU DNA 
fragment of the P. pastorls HIS4 gene. Each of these probes were separately used on identical filters for 
hybridization lasting 24 hours at 42* C. After hybridization the filters were washed twice for 15 minutes at 
room temperature, and three times at 65*C for 15 minutes in a solution containing 2x SSC, 1 mM EDTA, 
0.1% SDS and 0.1% sodium pyrophosphate. Other washes were tried at lower stringency, i.e., at 55 *C and 

20 42 *C, to confirm the hybridization results. The filters were then autoradiographed for 72 hours. The results 
of these hybridizations are summarized in Table 111. 
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10 



TABLE III 

Hybridization of P> pastoris Genes to Various Yeast 
Chromosomal DNA * 

P. yastoris 



1) P. pastoris 
NRRL Y-11430 



HIS4 



pPC 8.3 pPC 15.0 PPC 6.7 
+ + + 



2) P, pastoris 
NRRL Y-1603 



20 



3) Hansenula 
capsulatum 

4) tf. henricii 



(+> 



25 



5) H- nonfermentans + 



6) H. polymorpha 



(+) 



30 



7) H. wickerhamii 



35 



8) Tcrulopsis 
molischiana 



9) Saccharomyces 
cerevisiae 



45 



10) P. pastoris 
NRRL Y-15851 



*Legend: + hybridization 

(+) weak hybridization 
60 - no hybridization observed under the 

conditions employed 



55 

The results present d in Tab) til indicate that genes for polypeptides analogous to p76, p72 and p40 
are present in virtually all methanol-assimtlating y asts. It is notable that non of these three gen s wer 
observed by hybridization of DNA from a methanol non-assimilating yeast, S. cerevisiae, while homology 
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between the P. pastorls HIS4 gene and the HIS4 gen from S. cerevisiae was readily observed. 

The examples have been provided merely to illustrate the practice of the invention and should not be 
read so as to limit the scope of the invention or the appended claims in any way. Reasonable variations and 
modifications, not departing from the essence and spirit of the invention, are contemplated to be within the 
5 scope of patent protection desired and sought. 
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The following part of the description are preferred embodiments 1 to 34 presented in the format of claims. 

1 . A DNA fragment comprising a regulatory region derived from Pichia pastoris wherein said regulatory 
region is responsive to the presence of methanol in the culture medium with which a host organism for 

20 said DNA fragment is in contact, wherein said regulatory region is capable of controlling the transcription 
of messenger RNA when positioned at the 5* end of a DNA which codes for said messenger RNA, said 
regulatory region being selected from: 

(a) the regulatory region which controls the transcription of messenger RNA which codes for 
dihydroxyacetone synthetase obtainable from clone pPG6.0 (NRRL B-15867) from the 5'-Hindlll- 

25 restriction site to the 3*-Xho (-restriction site; 

(b) the regulatory region which controls the transcription of messenger RNA which codes for alcohol 
oxidase obtainable from clone pPG4.0 (NRRL B-15868) from the 5'-EcoRI-restriction site to the 3'- 
EcoRV-restriction site; 

(c) the regulatory region which controls the transcription of messenger RNA which codes for p40 
30 obtainable from clone pPG4.8 (NRRL B-15869) from the S'-BamHI-restriction site to the 3'-Sall- 

restriction site. 

2. A DNA fragment in accordance with embodiment 1 wherein said regulatory region is derived from the 
yeast species Pichia pastoris. 

35 3. A DNA fragment in accordance with embodiment 2 wherein said yeast is Pichia pastoris NRRL Y- 
11430. 

4. A DNA fragment in accordance with embodiment 3 wherein said regulatory region controls the 
transcription of messenger RNA which codes for the production of alcohol oxidase. 

5. A DNA fragment in accordance with embodiment 3 wherein said regulatory region controls the 
40 transcription of messenger RNA which codes for the production of the polypeptide p76. 

6. A DNA fragment in accordance with embodiment 3 wherein said regulatory region controls the 
transcription of messenger RNA which codes for the production of the polypeptide p40. 

7. A DNA fragment in accordance with embodiment 4 further comprising the 3' sequence of DNA 
downstream of the DNA which codes for the production of alcohol oxidase. 

45 8. A DNA fragment in accordance with embodiment 5 further comprising the 3' sequence of DNA 
downstream of the DNA which codes for the production of polypeptide P76. 

9. A DNA fragment in accordance with embodiment 6 further comprising the 3* sequence of DNA 
downstream of the DNA which codes for the production of polypeptide p40. 

10. A DNA fragment in accordance with embodiment 1 wherein said messenger RNA codes for the 
50 production of a heterologous polypeptide. 

11. A DNA fragment in accordance with embodiment 4 wherein said fragment has the nucleotide 
sequence: 
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S'-ATGCTTCCAA 
ACGTTCATGA 
GACAGGCAAT 
AACCTTTTTT 
TTGCGACTGG 
CTTTTAACGA 
ATTCGAAACG"3 ' . 



GATTCTGGTG 
TCAAAATTTA 
ATATAAACAG 
TTTATCATCA 
TTCCAATTCA 
CAACTTGAGA 



GGAATACTGC 
ACTGTTCTAA 
AAGGAAGCTG 
TTATTAGCTT 
CAAGCTTTTG 
AGATCAAAAA 



TGATAGCCTA 
CCCCTACTTG 
CCCTGTCTTA 
ACTTTCATAA 
ATTTTAACGA 
ACAACTAATT 



12. A DNA fragment in accordance with embodiment 4 wherein said fragment has the nucleotide 
sequence: 



S / "AATGGCCCAAA 


CTGACAGTTT 


AAACGCTGTC 


TTGGAACCTA 


. ATATGACAAA 


AGCGTGATCT 


CATCCAAGAT 


GAACTAACTT 


TGGTTCGTTG 


AAATCCTAAC 


GGCCAGTTGG 


TCAAAAAGAA 


ACTTCCAAAA 


GTCGGCATAC 


CGTTTGTCTT 


GTTTGGTATT 


GATTGACGAA 


TGCTCAAAAA 


TAATCTCATT 


AATGCTTAGC 


GCAGTCTCTC 


TATCGCTTCT 


GAACCCGGTG 


GCACCTGTGC 


CGAAACGCAA 


ATGGCGAAAC 


AACCCGCTTT 


TTGGATGATT 


ATGCATTGTC 


TCCACATTGT 


ATGCTTCCAA 


GATTCTGGTG 


GGAATACTGC 


TGATAGCCTA 


ACGTTCATGA 


TCAAAATTTA 


ACTGTTCTAA 


CCCCTACTTG 


GACAGGCAAT 


ATATAAACAG 


AAGGAAGCTG 


CCCTGTCTTA 


AACCTTTTTT 


TTTATCATCA 


TTATTAGCTT 


ACTTTCATAA 


TTGCGACTGG 


TTCCAATTGA 


CAAGCTTTTG 


ATTTTAACGA 


CTTTTAACGA 


CAACTTGAGA 


AGATCAAAAA 


ACAACTAATT 


ATTCGAAACG" 


3'. 



13. A DNA fragment in accordance with embodiment 4 wherein said fragment has the nucleotide 
sequence: 
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5 -AGATCTAA 


CATCCAAACA 




AATGAAACCT 


TTTTGCCATC 


CGACATCCAC 


AIjvj 1 L. c. a in c 


TC A C A C AT AA 


GTGCCAAACG 


CAACAGGAGG 


t»tiATACACTA 


GCAGCAGACG 


TTGCAAACGC 


AGGACTCATC 


u a CTTCTCTA 


ACACCATTTT 


GCATGAAAAC 


AGCCAGTTAT 


GGGCTTGATG 


GAGCTCGCTC 


ATTCCAATTC 


CTTCTATTAG 


&CTACTAACA 


CCATGACTTT 


ATTAGCCTGT 


CTATCCTGGC 


CCCC CTGGCG 


AGGTCATGTT 


TGTTTATTTC 


CGAATGCAAC 


AAGCTCCGCA 


TTACACCCGA 


ACATCACTCC 


AGATGAGGGC 


TTTCTGAGTG 


TGGGGTCAAA 


TAGTTTCATG 


TTCCCAAATG 


GCCCAAAACT 


GACAGTTTAA 


ACGCTGTCTT 


GCAACCTAAT 


ATGACAAAAG 


CGTGATCTCA 


TCCAAGATGA 


ACTAAGTTTG 


GTTCGTTGAA 


ATCCTAACGG 


CCAGTTGGTC 


AAAAAGAAAC 


TTCCAAAAGT 


CGGCATACCG 


TTTGTCTTGT 


TTGGTATTGA 


TTGACGAATG 


CTCAAAAATA 


ATCTCATTAA 


TGCTTAGCGC 


AGTCTCTCTA 


TCGCTTCTGA 


ACCCGGTGGC 


ACCTGTGCCG 


AAACGCAAAT 


GGGGAAACAA 


CCCGCTTTTT 


GGATGATTAT 


GCATTGTCCT 


CCACATTCTA 


TGCTTCCAAG 


ATTCTGGTGG 


GAATACTGCT 


GATAGCCTAA 


CGTTCATGAT 


CAAAATTTAA 


CTGTTCTAAC 


CCCTACTTGG 


ACAGGCAATA 


TATAAACAGA 


AGGAAGCTGC 


CCTGTCTTAA 


ACCTTTTTTT 


TTATCATCAT 


TATTAGCTTA 


CTTTCATAAT 


TGCGACTGGT 


TCCAATTGAC 


AAGCTTTTGA 


TTTTAACGAC 


TTTTAACGAC 


AACTTGAGAA 


GATCAAAAAA 


CAACTAATTA 


TTCGAAACG" 3 • . 





14. A DNA fragment in accordance with embodiment 5 wherein said fragment has the nuclotide 
sequence: 









5 1 -TT 


CACCCATACA 


ACTATAAACC 


TTAGCAATTG 


AAATAACCCC 


AATTCATTGT 


TCCGAGTTTA 


ATATACTTGC 


CCCTATAAGA 


AACCAAGGGA 


TTTCAGCTTC 


CTTACCCCAT 


GAACAGAATC 


TTCCATTTAC 


CCCCCACTGG 


AGAGATCCGC 


CCAAACGAAC 


AGATAATAGA 


AAAAAACAAT 


TCGGACAAAT 


ACAACACTTT 


CTCAGCCAAT 


TAAAGTCATT 


CCATGCACTC 


CCTTTAGCTG 


CCGTTCCATC 


C CTTTCTTG A 


CCAACACCAT 


CCTTACCCAG 


TACGAAAGAG 


GAAACTTAAC 


CCATACCTTG 


CAGAAATCIA 


AGGCGCGAAT 


GAGTTTAGCC 


TAGATATCCT 


TAGTGAAGGG 


TGTCCGATAC 


TTCTCCACAT 


TCACTCATAC 


ATGGGCAGCT 


TGTATCATCA 


AGAGACGGAA 


ACGCGCATAA 


GGCTAACCGC 
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CAAATTATAT 
CTTTCCTATT 
ATAAAAAGTT 
ACAAATTATA 
CAAACTATCA 



AAAGACAACA 
CTTCTATCCT 
CGTTTTAACT 
ACCCCTCTAA 
AACATC AAAA- 3 ■ 



TGCCCCAGTT 
GAGTGACCGT 
TAAGACCAAA 
ACACTAAAGT 



iAMGnrrr 

TGTGT1TAAT 
ACCAGTTACA 
TCACTCTTAT 



15. A DNA fragment in accordance with embodiment 1 further comprising: 

a polypeptide coding region; wherein said regulatory region is positioned at the 5* end of said 
polypeptide coding region. 

16. A DNA fragment in accordance with embodiment 15 wherein said polypeptide coding region codes 
for the production of alcohol oxidase. 

17. A DNA fragment in accordance with embodiment 16 wherein said fragment has the nucleotide 
sequence: 



5' -ATG GCT ATC CCC GAA GAG TTT 

3 1 -TAG CGA TAG GGG CTT CTC AAA 

GAT ATC CTA GTT CTA GGT GGT GGA TTC AGT GGA TCC 

CTA TAG GAT CAA GAT CCA CCA CCT AGG TCA CCT AGG 

TGT ATT TCC GGA AGA TTG GCA AAC TTG GAC CAC TCC 

ACA TAA AGG CCT TCT AAC CGT TTG AAC CTG GTG AGG 

TTG AAA GTT GGT CTT ATC GAA GCA GGT GAG AAC CAA 

AAC TTT CAA CCA GAA TAG CTT CGT CCA CTC TTG GTT 

CCT CAA CAA CCC ATG GGT CTA CCT TCC AGG TAT TTA 

GGA GTT GTT GGG TAC CCA GAT GGA AGG TCC ATA AAT 

CCC AAG AAA CAG AAG TTG GAC TCC AAG ACT GCT TCC 

GGG TTC TTT GTC TTC AAC CTG AGG TTC TGA CGA AGG 



TTC 


TAC 


ACT 


TCT 


AAC 


CCA 


AAG 


ATG 


TGA 


AGA 


TTG 


GGT 


AGA 


AGA 


GCC 


ATC 


GTT 


CCA 


TCT 


TCT 


CGG 


TAG 


CAA 


GGT 


GGT 


CGT 


TCT 


TCT 


ATC 


AAC 


CCA 


CCA 


AGA 


AGA 


TAG 


TTG 


GGT 


TCT 


GCT 


TCT 


GAT 


TCT 


CCA 


AGA 


CGA 


AGA 


CTA 


AGA 


GGC 


TCG 


AAA 


ACA 


GAG 


GAC 


CCG 


AGC 


TTT 


TGT 


CTC 


CTG 


AAG 


ACT 


GAG 


ACC 


TAC 


CAA 


TTC 


TGA 


CTC 


TGG 


ATG 


GTT 



TCT 


CCT 


CAC 


TTG 


AAT 


GGT 


AGA 


GGA 


GTG 


AAC 


TTA 


CCA 


TGT 


GCT 


AAC 


GTC 


TTG 


GGT 


ACA 


CGA 


TTG 


CAG 


AAC 


CCA 


TTC 


ATG 


ATG 


TAC 


ACC 


AGA 


AAG 


TAC 


TAC 


ATG 


TGG 


TCT 


GAT 


GAC 


TTN 


CAA 


GCC 


GAG 


CTA 


CTG 


AAN 


GTT 


CGG 


CTC 


TTG 


CTT 


CCA 


TTG 


ATG 


AAA 


AAC 


GAA 


GGT 


AAC 


TAC 


TTT 


AGA 


GCT 


TGN 


CAA 


CNA 


TAC 


TCT 


CGA 


ACN 


GTT 


GNT 


ATG 



CCT GAC ATT CAC GGT TTC GAA GGT CCA ATC AAG GTT 

GGA CTG TAA GTG CCA AAG CTT CCA GGT TAG TTC CAA 

TCT TTC GGT AAC TAC ACC TAC CCA GTT TGC CAC GAC 

AGA AAG CCA TTG ATG TGG ATG GGT CAA ACG GTC CTG 
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TTC 
AAC 


TTG 
AAC 


TCC 


CCT 
CGA 


TCT 

AGA 


ran 

CTC 


AGG 


CAA 

GTT 


GGT 
CCA 


ATT 
TAA 


CCA 
CGT 


TAC 
ATG 


5 


GTT 
CAA 


UnL 

CTG 


rkT 
unX 

CTA 


V. J. 1* 

GAC 


CAA 

CTT 


CAC 

CTG 


TTG 
AAC 


OTA 

CAT 


CTC 
w x w 

GAC 


ACT 
TGA 


CAC 

UAL* 

CTG 


CCT 
vox 

CCA 




x 

CGA 


a a 

CTT 


pip 
CTG 


T/V 

ACC 


TTG 
AAC 


a a n 
TTC 


1 LK* 

ACC 


TAG 


AAC 

TTG 


TCT 


CAC 

CTG 


ACT 

TGA 


TO 


UuT 
GCA 


CGT 
GCA 


TCC . 
AGG 


GAC 
CTG 


TCT 
ACA 


pp«f* 

GCT 
CGA 


CA1 
GTA 


pp a 
CGT 


TTT 
AAA 


CAC 


rap 
UAL 

GTG 


TCT 
AGA 




TCT 
AGA 


ACT 
TGA 


ATG 
TAC 


* a p a 
AGA 

TCT 


A A P 

AAC 
TTG 


CTG 


GAC 
CTG 


A A P 

TTG 


TTU 

AAC • 


A P 

TAC 
ATG 


lib 

AAC 


ATC 
TAG 


15 


TGT 
ACA 


AAC 
TTG 


ACG 
TGC 


AAG 
TTC 


GTC 
CAG 


GAC 
CTG 


AAA 
TTT 


ATT 
TAA 


ATT 
TAA 


GTC 
CAG 


GAA 
CTT 


GAC 
CTG 




CGA 
CCT 


AGA 
TCT 


GCT 
CGA 


GCT 
CGA 


GCT 
CGA 


GTT 
CAA 


AGA 
TCT 


ACC 
TGG 


GTT 
CAA 


CCA 
GGT 


AGC 
TCG 


AAG 
TTC 


20 


CCT 
GCA 


TTG 
AAC 


AAC 
TTG 


CCA 
GGT 


AAG 
TTC 


AAG 
TTC 


CCA 
GGT 


AGT 
TCA 


CAC 
GTG 


AAG 
TTC 


ATC 
TAG 


TAC 
ATC 




CGT 
GCA 


GCT 
CCA 


AGA 
TCT 


AAG 
TTC 


CAA 
GTT 


ATC 
TAG 


TTT 
AAA 


TTG 
AAC 


TCT 
AGA 


TGT 
ACA 


GGT 
CCA 


ACC 
TGG 


25 


ATC 
TAG 


TCC 
AGG 


TCT 
AGA 


CCA 
GGT 


TTG 
AAC 


GTT 
CAA 


TTG 
AAC 


CAA 
GTT 


AGA 
TCT 


TCC 
AGG 


GGT 
CCA 


TTT 
AAA 




GGT 
CCA 


GAC 
CTG 


CCA 
GGT 


ATC 
TAG 


AAG 
TTC 


TTG 
AAC 


AGA 
TCT 


GCC 
CGG 


GCT 
CGA 


GGT 
CCA 


GTT 
CAA 


AAG 
TTC 


30 


CCT 
GGA 


TTG 
AAC 


CTC 
CAG 


AAC 
TTG 


TTG 
AAC 


CCA 
GGT 


GGT 
CCA 


GTC 
CAG 


GGA 
CCT 


AGA 
TCT 


AAC 
TTG 


TTC 
AAG 


CAA 
GTT 


GAC 
CTG 


CAT 
GTA 


TAT 
ATA 


TGT 
ACA 


TTC 
AAG 


TTC 
AAG 


AGT 
TCA 


CCT 
GGA 


TAC 
ATG 


AGA 
TCT 


ATC 
TAG 




AAG 
TTC 


CCT 
GCA 


CAG 
GTC 


TAC 
ATG 


GAG 
CTC 


TCT 
AGA 


TTC 
AAG 


GAT 
CTA 


GAC 
CTG 


TTC 
AAG 


GTC 
CAG 


CGT 
GCA 


35 


GGT 
CCA 


GAT 
CTA 


GCT 
CGA 


GAG 
CTC 


ATT 
TAA 


CAA 
GTT 


AAG 
TTC 


AGA 
TCT 


GTC 
CAG 


GTT 
CAA 


GAC 
CTG 


P A ft 

CAA 
GTT 




TGG 
ACC 


TAC 
ATG 


GCC 
CGG 


AAT 
TTA 


GGT 
CCA 


ACT 
TGA 


GGT 
CCA 


CCT 
GGA 


CTT 
GAA 


GCC 
CGG 


ACT 
TGA 


MAP 

AAC 
TTG 


40 


GGT 
CCA 


ATC 
TAG 


CAA 
CTT 


GCT 
CGA 


GGT 
CCA 


GTC 
CAG 


A A 

AAG 
TTC 


ATC 
TAG 


AGA 

TCT 


CCA 
GGT 


A P A 

ACA 
TGT 


f 

pp » 
GGT 




GAA 
CTT 


GAA 
CTT 


CTC 
GAG 


TCT 
AGA 


CAA 

GTT 


ATG 
TAC 


GAC 
CTG 


GAA 
CTT 


TCC 
AGG 


TTC 
AAG 


CAG 
GTC 


GAG 
CTC 


45 


GGT 
CCA 


TAC 
ATG 


AGA 
TCT 


GAA 
CTT 


TAC 
ATG 


TTC 
AAG 


GAA 
CTT 


GAC 
CTG 


AAG 
TTC 


CCA 
GGT 


GAC 
CTG 


AAG 
TTC 




CCA 
GGT 


GTT 
CAA 


ATG 
TAC 


CAC 
GTG 


TAC 
ATG 


TCC 
AGG 


ATC 
TAG 


ATT 
TAA 


GCT 
CGA 


GGT 
CCA 


TTC 
AAG 


TTC 
AAG 


50 


GGT 
CCA 


GAC 
CTG 


CAC 
GTG 


ACC 
TGG 


AAG 
TTC 


ATT 
TAA 


CCT 
GGA 


CCT 
GGA 


CCA 
CCT 


AAG 
TTC 


TAC 
ATG 


ATG 
TAC 




ACT 
TGA 


ATG 
TAC 


TTC 
AAG 


CAC 
GTG 


TTC 
AAG 


TTG 
AAC 


GAA 
CTT 


TAC 
ATG 


CCA 
GGT 


TTC 
AAG 


TCC 
AGG 


AGA 
TCT 


55 


CGT 
CCA 


TCC 
AGG 


ATT 
TAA 


CAC 
GTG 


ATT 
TAA 


ACC 
TGG 


TCC 
AGG 


CCA 
GGT 


GAC 
CTG 


CCA 
GGT 


TAC 
ATG 


GCA 
CGT 
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GCT 
CGA 


CCA 
GGT 


GAC 
CTG 


TTC 
AAG 


GAC 
CTG 


CGA 
GCT 


GGT 
CCA 


TTC 
AAG 


ATG 
TAC 


AAC 
TTG 


GAT 
CTA 


GAA 

CTT 


AGA 
TCT 


GAC 
CTG 


ATG 
TAC 


GCT 
CGA 


CCT 
GGA 


ATG 
TAC 


GTT 
CAA 


TGG 
ACC 


GCT 
CGA 


TAC 
ATG 


AAG 
TTC 


TPT 
X v» X 

TTC 


TCT 
AGA 


AGA 
TCT 


GAA 
CTT 


ACC 
TGG 


GCT 
CGA 


AGA 
TCT 


AGA 
TCT 


AGT 
TCA 


GAC 
CTG 


CAC 
GTG 


TTT 
X A X 

AAA 


GPP 

CGG 


GGT 
CCA 


GAG 
CTC 


GTC 
CAG 


ACT 
TGA 


TCT 
AGA 


CAC 
GTG 


CAP 

GTG 


CCT 
GGA 


t "TP 
Will 

GAC 


TTC 
AAG 


pp& 

UUA 

GGT 


JLAL 

ATG 


TCA 
AGT 


TCC 
AGG 


GAG 
CTC 


GCC 
CGG 


AGA 
TCT 


GCC 
CGG 


Ttg 

X X w 

AAC 


CAA 
CTT 


ATG 
TAC 


GAT 
CTA 


TTG 
AAC 


pfcp 
uAu 

CTC 


ACC 
TGG 


TCT 
AGA 


AAT 
TTA 


GCC 
CGG 


TAC 
ATG 


GGT 
CCA 


GGA 

CCT 


CCT 
GCA 


TTG 
X X\» 

AAC 


a ap 

AAV* 

TTG 


TTG 
AAC 


TCT 
AGA 


GCT 
CGA 


GGT 
CCA 


CTT 
GAA 


GCT 
CGA 


CAC 
GTG 


GCT 
CCA 


AGA 


TGG 

ACC 


APT 

TGA 


PH & 

GTT 


pp»p 
GGA 


TTG 
AAC 


AAG 
TTC 


AAG 
TTC 


CCA 
GGT 


ACT 
nv< x 

TGA 


GCA 

CGT 


AAG 

TTC 


A AP 

TTG 


ph & 
CTT 


ppp 
CCG 


CAL 

GTG* 


p t »¥• 

GIT 

CAA 


H / **f* 

ACT 
TGA 


TCG 
AGC 


AAC 
TTG 


CAG 
GTC 


GTC 
CAG 


GAG 

CTC 


CTT 
GAA 


P&T 
wAx 

GTA 


pp n 
GGT 


pup 
UAL 

CTG 


T*P 

TAG 


pup 

CTC 


m« p 

TAC 
ATG 


GAT 
CTA 


GAG 
CTC 


GAG 
CTC 


GAT 

CTA 


GAP 

CTG 


AAG 

TTC 


ppp 
CGG 


ATT 
TAA 


pan 
bAu 

CTC 


»pp 

ALL 
TTG 


•PUP 

TAC 
ATG 


ATT 
TAA 


CGT 
CCA 


GAG 
CTC 


CAC 
GTG 


ACT 
TGA 


GAG 
CTC 


ACC 

TGG 


APA 

TGT 


ACC 


PAP 

GTG 


TPT 

ACA 


PTP 
Liu 

CCA 


p_p » 
wA 

GGT 


ACC 
TGG 


TGT 
ACA 


TCC 
AGG 


ATC 
TAG 


GGT 
CCA 


CCA 

GGT 


AGA 

TCT 


GAA 

wAA 

CTT 


GGT 

CCA 


TPP 

AGG 


AAG 

TTC 


AIL 

TAG 


GTC 
CAG 


AAA 
TTT 


TGG 
ACC 


GGT 
CCA 


GGT 
CCA 


GTT 

V*X X 

CAA 


TTG 

■Law 

AAC 


CAP 

CTG 


PAP 

GTG 


AGA 
nun 

TCT 


TPP 
AGG 


AAP 

TTG 


GTT 
CAA 


ATG 


PP Tl 

CCT 


GTC 
CAG 


AAG 
TTC 


GGC 
CCG 


TIG 
AAC 


AAG 
TTC 


GTT 
CAA 


GGT 
CCA 


GAC 
CTG 


TTC 
AAC 


TCC 
AGG 


GTG 
CAC 


TGC 
ACG 


CCA 
GGT 


GAC 
CTG 


AAT 
TTA 


GTT 
CAA 


GGT 
CCA 


TGT 
ACA 


AAC 
TTG 


ACC 
TGG 


TAC 
ATG 


ACC 
TGG 


ACC 
TGG 


GCT 
CGA 


CTT 
GAA 


TTG 
AAC 


ATC 
TAG 


GGT 
CCA 


GAA 
CTT 


AAG 
TTC 


ACT 
TGA 


GCC 
CGG 


ACT 
TGA 


TTG 
AAC 


GTT 
CAA 


GGA 
CCT 


GAA 
CTT 


CAT 
CTA 


TTA 
AAT 


GGA 
CCT 


TAC 
ATG 


TCT 
AGA 


GGT 
CCA 


GAG 
CTC 


GCC 
CGG 


TTA 
AAT 


GAC 
CTC 


ATG 
TAC 


ACT 
TGA 


GTT 
CAA 


CCT 
GGA 


CAG 
GTC 


TTC 
AAG 


AAG 
TTC 


TTG 
AAC 


GGC 
CCG 


ACT 
TGA 


TAC 
ATG 


GAG 
CTC 


AAG 
TTC 


ACC 
TGG 


GGT 
CCA 


CTT 
GAA 


GCT 
CGA 


AGA 
TCT 


TTC 
AAG 


TAA-3' 
ATT -5 f 





18. A DNA fragment in accordance with embodiment 15 wherein said polypeptide coding region codes 
for the production of polypeptide p76. 

19. A DNA fragment in accord arte with embodim nt 15 wherein said polypeptide coding region codes 
for th production of polyp ptid p40. 

20. A DNA fragment in accordanc with embodim nt 15 wherein said polypeptide coding region codes 
for the production of a het rologous polypeptide. 
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21. A DNA fragment in accordance with embodim nt 20 wherein said heterologous polypeptid is 0- 
galactosidase. 

22. A DNA fragment in accordance with embodiment 15 further comprising a 3' sequence of DNA 
downstream of the polypeptide coding region; wherein said 3* sequence of DNA is capable of controlling 
the polyadenylation, termination of transcription and termination of translation of messenger RNA coded 
for by said polypeptide coding region. 

23. A DNA fragment in accordance with embodiment 15 or 22 wherein said; DNA fragment further 
comprises one or more additional DNA sequences derived from the group consisting of 

bacterial plasmid DNA, 
bacteriophage DNA, 
yeast plasmid DNA. and 
yeast chromosomal DNA. 

24. A DNA fragment in accordance with embodiment 23 wherein said yeast chromosomal DNA 
comprises an autonomously replicating DNA sequence and a marker gene. 

25. A gene coding for the production of alcohol oxidase, or a sub-unit thereof or an equivalent of such a 
gene or sub-unit. 

26. A gene in accordance with embodiment 25 wherein said gene has the nucleotide sequence: 











5' 
3' 


-ATG 
-TAC 


GCT 
CGA 


ATC 
TAG 


CCC 
GGG 


GAA 
CTT 


GAG 
CTC 


TTT 
AAA 


GAT 
CTA 


ATC 
TAG 


CTA 
GAT 


GTT 
CAA 


CTA 
GAT 


GGT 
CCA 


GGT 
CCA 


GGA 
CCT 


TTC 
AGG 


AGT 
TCA 


GGA 
CCT 


TCC 
AGG 


TGT 
ACA 


ATT 
TAA 


TCC 
AGG 


GGA 
CCT 


AGA 
TCT 


TTG 
AAC 


GCA 
CGT 


AAC 
TTG 


TTG 
AAC 


GAC 
CTG 


CAC 
GTG 


TCC 
AGG 


TTG 
AAC 


AAA 
TTT 


GTT 
CAA 


GGT 
CCA 


CTT 
GAA 


ATC 
TAG 


GAA 
CTT 


CCA 
CGT 


GGT 
CCA 


GAG 
CTC 


AAC 
TTG 


CAA 
GTT 


CCT 
GGA 


CAA 
GTT 


CAA 
GTT 


CCC 
GGG 


ATG 
TAC 


GGT 
CCA 


CTA 
GAT 


CCT 
GGA 


TCC 
AGG 


AGG 
TCC 


TAT 
ATA 


TTA 
AAT 


CCC 
GGG 


AAG 
TTC 


AAA 
TTT 


CAG 
GTC 


AAG 
TTC 


TTG 
AAC 


GAC 
CTG 


TCC 
AGG 


AAG 
TTC 


ACT 
TGA 


GCT 
CGA 


TCC 
AGG 


TTC 
AAG 


TAC 
ATG 


ACT 
TGA 


TCT 
AGA 


AAC 
TTG 


CCA 
GGT 


TCT 
AGA 


CCT 
GGA 


CAC 
GTG 


TTG 
AAC 


AAT 
TTA 


GGT 
CCA 


AGA 
TCT 


AGA 
TCT 


GCC 
CGG 


ATC 
TAG 


GTT 
CAA 


CCA 
GGT 


TGT 
ACA 


GCT 
CGA 


AAC 
TTG 


GTC 
CAG 


TTG 
AAC 


GGT 
CCA 


GGT 
CCA 


GGT 
CCA 


TCT 
AGA 


TCT 
AGA 


ATC 
TAG 


AAC 
TTG 


TTC 
AAG 


ATG 
TAC 


ATG 
TAC 


TAC 
ATG 


ACC 
TGG 


AGA 
TCT 


GGT 
CCA 


TCT 
AGA 


GCT 
CGA 


TCT 
AGA 


GAT 
CTA 


TCT 
AGA 


GAT 
CTA 


GAC 
CTG 


TTN 
AAN 


CAA 
GTT 


GCC 
CGG 


GAG 
CTC 


GGC 
CCG 


TCG 
AGC 


AAA 
TTT 


ACA 
TGT 


GAG 
CTC 


GAC 
CTG 


TTG 
AAC 


CTT 
GAA 


CCA 
GCT 


TTG 
AAC 


ATG 
TAC 


AAA 
TTT 


AAC 
TTC 


ACT 
TGA 


GAG 
CTC 


ACC 
TGG 


TAC 
ATG 


CAA 
GTT 


AGA 
TCT 


GCT 
CGA 


TGN 
ACN 


CAA 
GTT 


CNA 
GNT 


TAC 
ATG 


CCT 
GGA 


GAC 
CTG 


ATT 
TAA 


CAC 
GTG 


GGT 
CCA 


TTC 
AAC 


GAA 
CTT 


GGT 
CCA 


CCA 
GGT 


ATC 
TAG 


AAG 
TTC 


GTT 
CAA 


TCT 
AGA 


TTC 
AAG 


GGT 
CCA 


AAC 
TTG 


TAC 
ATG 


ACC 
TGG 


TAC 
ATG 


CCA 
GGT 


GTT 
CAA 


TCC 
ACG 


CAG 
GTC 


GAC 
CTG 
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TTC 

AAG 


TTG 
AAC 


AGC 
TCC 


CCT 
CGA 


TCT 
AGA 


GAG 
CTC 


TCC 
AGG 


CAA 
GTT 


GGT 
CCA 


ATT 
TAA 


CCA 

CCT 

WW A 


TAC 

ATf* 


5 


GTT 
CAA 


GAC 
CTG 


GAT 
CTA 


CTG 
GAC 


GAA 
CTT 


GAC 
CTG 


TTG 
AAC 


GTA 
CAT 


CTG 
GAC 


ACT 
TGA 


CAC 

CTC 

\*X\* 


GGT 

CCA 




GCT 
CGA 


GAA 
CTT 


CAC 
GTG 


TGG 
ACC 


TTC 
AAC 


AAC 
TTC 


TGG 

ACC 
Awl* 


ATC 

TAC 


AAC 
TTG 


AGA 
TCT 


GAC 

CTG 


ACT 

TGA 




CGT 
GCA 


CGT 
GCA 


TCC 
AGG 


GAC 
CTG 


TCT 
AGA 


CCT 

CCA 


CAT 

CTA 


GCA 


TTT 

AAA 
AAA 


GTC 

Lau 


CAC 
GTG 


TCT 
AGA 


10 


TCT 
AGA 


ACT 
TGA 


ATG 
TAC 


AGA 
TCT 


AAC 

TTC 


. CAC 

CTC 
ui Km 


GAC 
CTG 


AAC 
TTG 


TTG 
AAC 


TAC 
ATG 


TTG 
AAC 


ATC 
TAG 




TGT 
ACA 


AAC 
TTG 


ACG 
TGC 


AAG 
TTC 


GTC 

CAC 


GAC 
CTG 


AAA 
TTT 


ATT 

TH A 

XAA 


ATT 

TAA 


GTC 
CAG 


GAA 
CTT 


GAC 
CTG 


15 


GGA 
CCT 


AGA 
TCT 


GCT 
CGA 


GCT 
CGA 


GCT 

CCA 


■GTT 

CAA 


AGA 

Tf*T 


ACC 

XWi 


GTT 
CAA 


CCA. 
GGT 


AGC 
TCG 


AAG 
TTC 




CCT 
GCA 


TTG 
AAC 


AAC 
TTG 


CCA 

CCT 

\X\JJL. 


AAG 
TTC 


AAG 
TTC 


CCA 
(Ail 


AGT 

TCA 


CAC 
GTG 


AAG 
TTC 


ATC 
TAG 


TAC 
ATG 


20 


CGT 
GCA 


GCT 
CGA 


AGA 
TCT 


AAG 

X X w 


CAA 

CTT 

WAX 


ATC 

TAf 
J. Au 


TTT 

AAA 
AAA 


TTG 
AAC 


TCT 
AGA 


TGT 
ACA 


GGT 
CCA 


ACC 
TGG 




ATC 
TAG 


TCC 
AGG 


TCT 
ACA 


CCA 
GGT 


TTG 

A Af* 


GTT 

f* A A 


TTG 

AAC 


CAA 

GTT 


AGA 
TCT 


TCC 
AGG 


GGT 
CCA 


TTT 
AAA 


25 


GGT 
CCA 


GAC 
CTG 


CCA 


ATC 

TAC 


AAG 
TTC 


TTG 

A IP 


AGA 

TCT 


GCC 
CGG 


GCT 
CGA 


GGT 
CCA 


GTT 
CAA 


AAG 
TTC 




CCT 

CCA 


TTG 

AAC 
Anv 


CTC 

CAC 


AAC 


TTG 
a &p 

AAC 


CCA 
GGT 


GGT 
CCA 


GTC 
CAG 


GGA 
CCT 


AGA 
TCT 


AAC 
TTG 


TTC 
AAG 


30 


CAA 
GTT 


GAC 
crn 

w lu 


CAT 


TAT 

ATA 


TGT 
ACA 


TTC 
AAG 


TTC 
AAG 


AGT 
TCA 


CCT^. 
GGA 


TAC 
ATG 


AGA 
TCT 


ATC 
TAG 


AAG 
TTC 


CCT 

Cf*A 


CAG 


TAC 


GAG 

tie 


TCT 
ACA 


TTC 
AAG 


GAT 
CTA 


GAC 
CTG 


TTC 
AAG 


GTC 
CAG 


CGT 
GCA 




GGT 
CCA 


GAT 
CTA 


GCT 

CCA 
ww#% 


GAG 

CTC 


ATT 

TAA 
Ann 


CAA 
GTT 


AAG 
TTC 


AGA 
TCT 


GTC 
CAG 


GTT 
CAA 


GAC 
CTG 


CAA 
CTT 


35 


TGG 
ACC 


TAC 
ATG 


GCC 

ccc 


AAT 

TTA 


GGT 

Cf*A 


ACT 

luA 


GGT 

CCA 


CCT 

GGA 


CTT 
GAA 


GCC 
CGG 


ACT 
TGA 


AAC 
TTG 




GGT 
CCA 


ATC 
TAG 


GAA 
CTT 


GCT 
CGA 


GGT 

CCA 
w w/% 


GTC 

CAC 


AAG 
TTC 


ATC 
Tar 


AGA 

Tf*T 


CCA 

GOT 


ACA 
TGT 


CCA 

GGT 


40 


GAA 
CTT 


GAA 
CTT 


CTC 


TCT 


CAA 
GTT 


ATG 
TAC 


GAC 
CTG 


GAA 
CTT 


TCC 
AGG 


TTC 
AAG 


CAG 
GTC 


GAG 
CTC 




CGT 
CCA 


TAC 
ATG 


AGA 
TCT 


GAA 
CTT 


TAC 
ATG 


TTC 
AAG 


GAA 
CTT 


GAC 
CTG 


AAG 
TTC 


CCA 
GCT 


GAC 
CTG 


AAG 
TTC 


45 


CCA 
GGT 


CTT 
CAA 


ATG 
TAC 


CAC 
GTG 


TAC 
ATG 


TCC 
AGG 


ATC 
TAG 


ATT 
TAA 


GCT 
CGA 


GGT 
CCA 


TTC 
AAG 


TTC 
AAG 




GGT 
CCA 


GAC 
CTG 


CAC 
GTG 


ACC 
TGG 


AAG 
TTC 


ATT 
TAA 


CCT 
CGA 


CCT 
GCA 


GGA 
CCT 


AAG 
TTC 


TAC 
ATG 


ATG 
TAC 


50 


ACT 
TGA 


ATG 
TAC 


TTC 
AAG 


CAC 
GTG 


TTC 
AAG 


TTG 
AAC 


GAA 
CTT 


TAC 
ATG 


CCA 
GGT 


TTC 
AAG 


TCC 
AGG 


AGA 
TCT 




CGT 
CCA 


TCC 
AGG 


ATT 
TAA 


CAC 
GTG 


ATT 
TAA 


ACC 
TGG 


TCC 
AGG 


CCA 
GGT 


GAC 
CTG 


CCA 
GGT 


TAC 
ATG 


GCA 
CGT 
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GCT 
CGA 


CCA 
GGT 


GAC 
CTG 


TTC 
AAG 


GAC 
CTG 


CGA 
GCT 


GGT 
CCA 


TTC 
AAC 


ATG 
TAC 


AAC 
TTG 


GAT 
CTA 


GAA 
CTT 


5 


AGA 
TCT 


GAC 
CTG 


ATG 
TAC 


GCT 
CGA 


CCT 
GGA 


ATG 
TAC 


GTT 
CAA 


TGG 
ACC 


GCT 
CGA 


TAC 
ATG 


AAG 
TTC 


TCT 
TTC 




TCT 
AGA 


AGA 
TCT 


GAA 
CTT 


ACC 
TGG 


GCT 
CGA 


AGA 
TCT 


ACA 
TCT 


AGT 
TCA 


GAC 
CTG 


CAC 
GTG 


TTT 
AAA 


GCC 
CGG 


10 


GGT 
CCA 


GAG 
CTC 


GTC 
CAG 


ACT 
TGA 


TCT 
AGA 


CAC 
GTC 


CAC 
GTG 


CCT 
GGA 


CTG 
GAC 


TTC 
AAG 


CCA 
GGT 


TAC 
ATG 




TCA 
AGT 


TCC 
AGG 


GAG 
CTC 


GCC 
CGG 


AGA 
TCT 


GCC 
CGG 


TTG 
AAC 


GAA 
CTT 


ATG 
TAC 


GAT 
CTA 


TTG 
AAC 


GAG 
CTC 


15 


ACC 
TGG 


TCT 
AGA 


AAT 
TTA 


GCC 
CGG 


TAC 
ATG 


GGT 
CCA 


GGA 
CCT 


CCT 
GGA 


TTG 
AAC 


AAC 
TTG 


TTG 
AAC 


TCT 
AGA 




GCT 
CGA 


GGT 
CCA 


CTT 
GAA 


GCT 
CGA 


CAC 
GTG 


GGT 
CCA 


TCT 
AGA 


TGG 
ACC 


ACT 
TGA 


CAA 
GTT 


CCT 
GGA 


TTG 
AAC 


20 


AAG 
TTC 


AAG 

TTC 


CCA 
GGT 


ACT 
TGA 


GCA 
CGT 


AAC 
TTC 


AAC 
TTG 


GAA 
CTT 


GGC 
CCG 


CAC 
GTG 


GIT 
CAA 


ACT 
TGA 




TCG 
AGC 


AAC 
TTG 


CAG 
GTC 


GTC 
CAG 


GAG 
CTC 


CTT 
GAA 


CAT 
GTA 


CCA 
CGT 


GAC 
CTG 


ATC 
TAG 


GAG 
CTC 


TAC 
ATG 


25 


GAT 
CTA 


GAG 
CTC 


GAG 
CTC 


GAT 
CTA 


GAC 
CTG 


AAG 
TTC 


GCC 
CGG 


ATT 
TAA 


GAG 
CTC 


ACC 
TTG 


TAC 
ATG 


ATT 
TAA 




CGT 
GCA 


GAG 
CTC 


CAC 
GTG 


ACT 
TGA 


GAG 
CTC 


ACC 
TGG 


ACA 
TGT 


TGG 
ACC 


CAC 
GTG 


TGT 
ACA 


CTG 
CCA 


GGA 
GGT 


30 


ACC 
TGG 


TGT 
ACA 


TCC 
AGG 


ATC 
TAG 


GGT 
CCA 


CCA 
GGT 


AGA 
TCT 


GAA 
CTT 


GGT 
CCA 


TCC 
AGG 


AAG 
TTC 


ATC 
TAG 




GTC 
CAG 


AAA 
TTT 


TGG 
ACC 


GGT 
CCA 


GGT 
CCA 


GTT 
CAA 


TIC 
AAC 


GAC 
CTG 


CAC 
GTG 


AGA 
TCT 


TCC 
AGG 


AAC 
TTG 


35 


GTT 
CAA 


TAC 
ATG 


GGA 
CCT 


GTC 
CAG 


AAG 
TTC 


GGC 
CCG 


TIG 
AAC 


AAG 
TTC 


GTT 
CAA 


GGT 
CCA 


GAC 
CTG 


TTG 
AAC 




TCC 
AGG 


GTC 
CAC 


TGC 
ACG 


CCA 
GGT 


GAC 
CTG 


AAT 
TTA 


GTT 
CAA 


GGT 
CCA 


TGT 
ACA 


AAC 
TTG 


ACC 
TGG 


TAC 
ATG 


40 


ACC 
TCG 


ACC 
TGG 


GCT 
CGA 


CTT 
GAA 


TTG 
AAC 


ATC 
TAG 


GGT 
CCA 


GAA 
CTT 


AAG 
TTC 


ACT 
TGA 


GCC 
CGG 


ACT 
TGA 




TTG 
AAC 


GTT 
CAA 


GGA 
CCT 


GAA 
CTT 


CAT 
CTA 


TTA 
AAT 


GGA 
CCT 


TAC 
ATG 


TCT 
AGA 


GGT 
CCA 


GAG 
CTC 


GCC 
CGG 


45 


TTA 
AAT 


GAC 
CTG 


ATG 
TAC 


ACT 
TGA 


GTT 
CAA 


CCT 
GGA 


CAG 
GTC 


TTC 
AAG 


AAG 
TTC 


TTG 
AAC 


GGC 
CCG 


ACT 
TGA 




TAC 
ATG 


GAG 
CTC 


AAG 
TTC 


ACC 
TGG 


GGT 
CCA 


CTT 
GAA 


GCT 
CGA 


AGA 
TCT 


TTC 
AAG 


TAA- 
ATT- 


3" 
5 1 





50 

27. A gene coding for the production of polypeptide p76, or a sub-unit thereof or an equivalent of such a 
gene or sub-unit. 

28. A gene coding for the production of polypeptide p40, or a sub-unit thereof or an equivalent of such a 
gene or sub-unit. 

55 29. A DNA fragment in accordance with embodiment 23 wherein said fragment is in the form of a closed 
circular hybrid plasm id. 

30. A transformed yeast strain; wher in said transform d yeast strain is a host for recombinant DNA 
material; 
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wherein said recombinant DNA material comprises: 

(1) a DNA fragment, and 

(2) a polypeptide coding region; wherein said DNA fragment is responsive to at least one of the 
conditions selected from the group consisting of: 

(i) the presence of methanol in the culture medium with which a host organism for said DNA 
fragment is in contact, and 

wherein said regulatory region is positioned at the 5* end of said polypeptide coding region; and, 
wherein said transformed yeast strain is capable of expressing the polypeptide coded for by said 
polypeptide coding region. 

31 . A transformed yeast strain in accordance with embodiment 30 wherein said transformed yeast strain 
is capable of growth on methanol as carbon and energy source. 

32. A transformed yeast strain in accordance with embodiment 30 wherein said recombinant DNA 
material further comprises a second DNA fragment; 

wherein said second DNA fragment is capable of controlling the polyadenylation, termination of 
transcription and termination of translation of messenger RNA when positioned at the 3' end of the 
polypeptide coding region which codes for the production of said messenger RNA. 

33. A process for preparing polypeptides comprising 

cultivating a transformed yeast strain in a nutrient medium containing methanol wherein said 
transformed yeast strain is capable of expressing an inserted polypeptide coding sequence derived from 
recombinant DNA material; wherein said recombinant DNA material comprises 

(1 ) a methanol responsive DNA fragment, and 

(2) a polypeptide coding region; wherein said methanol responsive DNA fragment is positioned at the 
5' end of said polypeptide coding region. 

34. A process in accordance with embodiment 33 further comprising; 
isolating and purifying said polypeptide. 

Claims 

Claims for the following Contracting States : BE, CH, DE, FR, GB, IT, Ll v LU, NI_ SE 

1. A DNA fragment comprising a regulatory region derived from Pichia pastoris, wherein said regulatory 
region is responsive to the presence of methanol in the culture medium with which a host organism for 
said DNA fragment is in contact, wherein said regulatory region is capable of controlling the transcrip- 
tion of messenger RNA when positioned at the 5' end of a DNA which codes for said messenger RNA, 
said regulatory region being selected from: 

(a) the regulatory region which controls the transcription of messenger RNA which codes for 
dihydroxyacetone synthetase obtainable from clone pPG6.0 (NRRL B-15867) from the 5*-Hindlll- 
restriction site to the S'-Xhol-restriction site; 

(b) the regulatory region which controls the transcription of messenger RNA which codes for alcohol 
oxidase obtainable from clone pPG4.0 (NRRL B-15868) from the S'-EcoRI-restriction site to the 3'- 
EcoRV-restriction site; 

(c) the regulatory region which controls the transcription of messenger RNA which codes for p40 
obtainable from clone pPG4.8 (NRRL B-15869) from the S'-BamHI-restriction site to the ^-Sail- 
restriction site. 

2. The DNA fragment of claim 1 further comprising a polypeptide coding region, wherein said regulatory 
region is positioned at the 5' end of said polypeptide coding region. 

3. The DNA fragment of claim 2 further comprising a 3* sequence of DNA downstream of the polypeptide 
coding region, wherein said 3" sequence of DNA is capable of controlling the polyadenylation, 
termination of transcription and termination of translation of messenger RNA coded for by said 
polypeptide coding region. 

4. The DNA fragment of claim 1 characterized in that said regulatory region is derived from Pichia 
pastoris NRRL Y-11430. 

5. The DNA fragment of claim 3 further comprising one or more additional DNA sequences derived from 
the group of bacterial plasmid DNA, bact riophage DNA, yeast plasmid DNA, and yeast chromosomal 
DNA. 
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6. The DNA fragment of claim 5 wh rein said y ast chromosomal DNA comprises an autonomously 
replicating DNA sequence and a marker gene. 

7. The DNA fragment of any of claims 2-6 characterized in that said polypeptide coding region codes for 
5 a heterologous polypeptide. 

8. The DNA fragment of any of claims 2-6 characterized in that said polypeptide coding region codes for 
alcohol oxidase. 

70 9. The DNA fragment of any of claims 2-6 characterized in that said polypeptide coding region codes for 
dihydroxyacetone synthase. 

10. The DNA fragment of any of claims 2-6 characterized in that said polypeptide coding region codes for 
polypeptide p40. 

75 

11. The DNA fragment of any of claims 2-6 characterized in that said fragment is in the form of a closed 
circular hybrid plasmid. 

12. A gene, derived from Pichia pastoris, coding for alcohol oxidase, said gene having as regulatory region 
20 a regulatory region comprised in the DNA fragment of claim 1 . 

13. A gene, derived from Pichia pastoris, coding for dihydroxyacetone synthase, said gene having as 
regulatory region a regulatory region comprised in the DNA fragment of claim 1 . 

25 14. A gene, derived from Pichia pastoris, coding for polypeptide p40, said gene having as regulatory region 
a regulatory region comprised in the DNA fragment of claim 1, 

15. A transformed yeast strain obtained by transforming a Pichia pastoris host with an expression vector 
carrying recombinant DNA material, wherein said recombinant DNA material comprises the DNA 

30 fragment of claim 3. 

16. The transformed yeast strain of claim 15, wherein said Pichia pastoris host is GS 115 (NRRL Y-15851). 

17. A process for preparing polypeptides comprising cultivating the transformed yeast strain of claim 15 or 
35 16 in a nutrient medium and inducing the expression in the presence of methanol and recovering the 

polypeptides. 

18. The process of claim 17 wherein the nutrient medium for culturing the transformed yeast strain of claim 
15 or 16 contains methanol. 

40 

Claims for the following Contracting State : AT 

1. A process for preparing polypeptides comprising cultivating a transformed yeast strain obtained by 
transforming a Pichia pastoris host with an expression vector carrying recombinant DNA material, 
45 wherein said recombinant DNA material comprises a DNA fragment comprising a regulatory region 
derived from Pichia pastoris wherein said regulatory region is responsive to the presence of methanol 
in the culture medium with which a host organism for said DNA fragment is in contact, wherein said 
regulatory region is capable of controlling the transcription of messenger RNA when positioned at the 5' 
end of a DNA which codes for said messenger RNA, said regulatory region being selected from: 
50 (a) the regulatory region which controls the transcription of messenger RNA which codes for 

dihydroxyacetone synthetase obtainable from clone pPG6.0 (NRRL B-15867) from the S'-Hindlll- 
restriction site to the 3'-Xhol-restriction site; 

(b) the regulatory region which controls the transcription of messenger RNA which codes for alcohol 
oxidase obtainable from clone pPG4.0 (NRRL B-15868) from the 5*-EcoRI-restriction site to the 3'- 

55 EcoRV-restriction site; 

(c) th regulatory region which controls th transcription of mess ng r RNA which codes for p40 
obtainable from clone pPG4.8 (NRRL B-15869) from the S'-BamHl-restriction site to the S'-Sall- 
restriction site; 
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wher in said cultivating takes place in a nutrient medium and inducing the xpr ssion in the presence 
of methanol and recovering the polypeptides. 

2. The process of claim 1 wherein said DNA fragment further comprises a polypeptide coding region, 
5 wherein said regulatory region is positioned at the 5* end of said polypeptide coding region. 

a. The process of claim 2 wherein said DNA fragment further comprises a 3* sequence of DNA 
downstream of the polypeptide coding region, wherein said 3* sequence of DNA is capable of 
controlling the polyadenylation, termination of transcription and termination of translation of messenger 
70 RNA coded for by said polypeptide coding region. 

4. The process of claim 1 characterized in that said regulatory region is derived from Pichia pastoris 
NRRL Y-11430. 

75 5. The process of claim 3 wherein said DNA fragment further comprises one or more additional DNA 
sequences derived from the group bacterial plasmid DNA, bacteriophage DNA, yeast plasmid DNA, and 
yeast chromosomal DNA. 

6. The process of claim 5 wherein said yeast chromosomal DNA comprises an autonomously replicating 
20 DNA sequence and a marker gene. 

7. The process of any of claims 2-6 characterized in that said polypeptide coding region codes for a 
heterologous polypeptide. 

25 8. The process of any of claims 2-6 characterized in that said polypeptide coding region codes for 
alcohol oxidase. 

9. The process of any of claims 2-6 characterized in that said polypeptide coding region codes for 
dihydroxyacetone synthase. 

30 

10. The process of any of claims 2-6 characterized in that said polypeptide coding region codes for 
polypeptide p40. 

11. The process of any of claims 2 6 characterized in that said fragment is in the form of a closed circular 
35 hybrid plasmid. 

12. A gene, derived from Pichia pastoris, coding for alcohol oxidase, said gene having as regulatory region 
a regulatory region comprised in the DNA fragment of claim 1 . 

40 13. A gene, derived from Pichia pastoris, coding for dihydroxyacetone synthase, said gene having as 
regulatory region a regulatory region comprised in the DNA fragment of claim 1 . 

14. A gene, derived from Pichia pastoris, coding for polypeptide p40, said gene having as regulatory region 
a regulatory region comprised in the DNA fragment of claim 1 . 

45 

15. The process of claim 1, wherein said Pichia pastoris host is GS 115 (NRRL Y-15851). 

16. The process of any of claims 1-15 wherein the nutrient medium for culturing the transformed yeast 
strain contains methanol. 

50 

Patentansprti che 

PatentansprUche fUr folgende Vertragsstaaten : BE, CH, DE, FR„ GB, IT, LI, LU, NL, SE 

1. DNA-Fragment, das eine regulatorische Region, die aus Pichia pastoris stammt. umfaflt, wobei die 
55 r gulatorische Region auf die Anwesenheit von Methanol im Kulturmedium anspricht, mit dem ein 
Wirtsorganismus fOr dies s DNA-Fragment in Kontakt ist, wobei die regulatorische Region fahig ist, die 
Transkription von Messenger RNA zu kontrollieren, wenn sie am 5' Ende einer DNA positioniert ist, die 
fUr dies Messenger RNA codiert, wobei die regulatorische R gion ausgewShlt ist aus: 
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a) die regulatorische Region, die die Transkription von Messenger RNA kontrolliert, die fUr Dih- 
ydroxyacetonsynthase codiert, wobei die Region aus Klon pPG 6.0 (NRRL B-15867) von der 5'- 
Hindlll-Schnittstelle bis zur 3'-Xhot-Schnittstelle erhSltlich ist; 

b) die regulatorische Region, die die Transkription von Messenger RNA kontrolliert. die fUr AJkohol- 
5 oxidase codiert. wobei die regulatorische Region aus Klon pPG 4.0 (NRRL B-15868) von der 5* 

EcoRI-Schnittstelle bis zur 3' EcoRV-Schnittstelle erhaltlich ist; 

c) die regulatorische Region, die die Transkription von Messenger RNA kontrolliert, die fUr p40 
codiert. wobei die regulatorische Region aus Klon pPG4.8 (NRRL B-15869) von der 5'-BamHI- 
Schnittstelle bis zu der 3'-Sall-Schnittstelle erhSltlich ist. 

10 

2. DN A- Fragment nach Anspruch 1, wobei dieses weiter eine Polypeptid codierende Region umfaBt. 
wobei die regulatorische Region am 5' Ende der Polypeptid codierenden Region positioniert ist. 

3. DNA-Fragment nach Anspruch 2, wobei dieses weiter eine 3*-DNA-Sequenz stromabwarts der Polypep- 
75 tid codierenden Region umfaBt, wobei die 3* DNA-Sequenz fShig ist. die Polyadenylierurig, Termination 

der Transkription und Termination der Translation von Messenger RNA. fUr die die Polypeptid 
codierende Region codiert. zu kontrollieren. 

4. DNA-Fragment nach Anspruch 1. dadurch gekennzeichnet, daB die regulatorische Region aus Pichia 
20 pastoris NRRL Y-1 1430 stammt. 

5. DNA-Fragment nach Anspruch 3. wobei dieses weiter eine Oder mehrere zusatzliche DNA-Sequenzen 
umfaBt, wobei diese aus der Gruppe Bakterienplasmid-DNA, Bakteriophagen-DNA. Hefeplasmid-DNA 
und Hefe-chromosomale DNA stammen. 

25 

6. DNA-Fragment nach Anspruch 5, wobei die chromosomale Hefe-DNA eine autonom replizierende DNA- 
Sequenz und ein Marker-Gen umfaBt 

7. DNA-Fragment nach einem der AnsprUche 2-6, dadurch gekennzeichnet. daB die Polypeptid codieren- 
30 de Region fur ein heterologes Polypeptid codiert. 

8. DNA-Fragment nach einem der AnsprUche 2-6. dadurch gekennzeichnet. daB die Polypeptid codieren- 
de Region fUr Alkoholoxidase codiert. 

35 9. DNA-Fragment nach einem der AnsprUche 2-6, dadurch gekennzeichnet, daB die Polypeptid codieren- 
de Region fUr Dihydroxyacetonsynthase codiert. 

10. DNA-Fragment nach einem der AnsprUche 2-6, dadurch gekennzeichnet. daB die Polypeptid codieren- 
de Region fUr Polpeptid p40 codiert. 

40 

11. DNA-Fragment nach einem der AnsprUche 2-6. dadurch gekennzeichnet. daB das Fragment in der 
Form etnes geschlossenen zirkularen Hybrtdplasmids ist. 

12- Aus Pichia pastoris stammendes Gen. das fUr Alkoholoxidase codiert, wobei das Gen als regulatorische 
45 Region eine regulatorische Region hat. die in dem DNA-Fragment von Anspruch 1 enthalten ist. 

13. Aus Pichia pastoris stammendes Gen, das fUr Dihydroxyacetonsynthase codiert, wobei das Gen als 
regulatorische Region eine regulatorische Region hat. die in dem DNA-Fragment von Anspruch 1 
enthalten ist. 

50 

14. Aus Pichia pastoris stammendes Gen. das fUr Polypeptid p40 codiert, wobei das Gen als regulatorische 
Region eine regulatorische Region hat, die in dem DNA-Fragment von Anspruch 1 enthalten ist. 

15. Transformierter Hefestamm, der erhalten wird durch Transformieren eines Pichia pastoris Wirts mit 
55 inem Expressionsvektor. der rekombinant s DNA-Material enthalt, wobei das rekombinant DNA- 

Material das DNA-Fragment von Anspruch 3 umfaBt. 

16. Transformierter Hefestamm nach Anspruch 15, wob i der Pichia pastoris Wirt GS 115 (NRRL Y-1 5851) 
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ist. 

17. Verfahren zum Herstellen von Polypeptides wobei dieses das Kultivieren des transformierten Hefe- 
stamms von Anspruch 15 oder 16 in einem Wachstumsmedium und Induzieren der Expression in 
5 Anwesenheit von Methanol und Gewinnen der Polypeptide umfaflt. 

1a Verfahren von Anspruch 17, wobei das Wachstumsmedium zum Kultivieren des transformierten 
Hefestamms von Anspruch 1 5 oder 1 6 Methanol enthSIt 

io PatentansprUche fiir folgenden Vertragsstaat : AT 

1. Verfahren zum Herstellen von Polypeptides das das Kultivieren eines transformierten Hefestamms 
umfaflt, der erhalten wird durch Transformieren eines Pichia pastoris Wirts mit einem Expressionsvek- 
tor, der rekombinantes DNA-Material enthaft, wobei das rekombinante DNA-Material ein DNA-Fragment 

75 umfaflt und wobei das DNA-Fragment eine aus Pichia pastoris stammende regulatorische Region 

enthalt, wobei die regulatorische Region auf die Anwesenheit von Methanol tm Kulturmedium mit dem 
ein Wirtsorganismus fUr das DNA-Fragment in Kontakt ist, anspricht, wobei die regulatorische Region 
fahig ist, die Transkription von Messenger RNA zu kontroHieren, wenn sie am 5* Ende der DNA 
positioniert ist. die fUr die Messenger RNA codiert, wobei die regulatorische Region ausgewahlt ist aus: 
20 a) die regulatorische Region, die die Transkription von Messenger RNA kontrolliert, die fUr Dih- 

ydroxyacetonsynthase codiert, wobei die Region aus Klon pPG 6.0 (NRRL B-15867) von der 5* 
Hindlll-Schnittstelle bis zur 3' Xhol-Schnittstelle erhSlttich ist; 

b) die regulatorische Region, die die Transkription von Messenger RNA kontrolliert, die fur Alkohol- 
oxidase codiert. wobei die regulatorische Region aus Klon pPG 4.0 (NRRL B-15868) von der 5* 

25 EcoRI-Schnittstelle bis zur 3* EcoRV-Schnittstelle erhSltlich ist; 

c) die regulatorische Region, die die Transkription von Messenger RNA kontrolliert, die fOr p40 
codiert, wobei die regulatorische Region aus Klon pPG4.8 (NRRL B-15869) von der 5' BamHI- 
Schnittstelle bis zu der 3' Sall-Schnittstelle erhaltlich ist, 

wobei das Kultivieren in einem Wachstums-Medium stattfindet, und Induzieren der Expression in 
30 Anwesenheit von Methanol und Gewinnen der Polypeptide. 

2. Verfahren nach Anspruch 1, wobei das DNA-Fragment weiter eine Polypeptid codierende Region 
umfafit, wobei die regulatorische Region am 5* Ende der Polypeptid codierenden Region positioniert ist. 

35 3. Verfahren nach Anspruch 2, wobei das DNA-Fragment weiter die 3' DNA-Sequenz stromabwarts der 
Polypeptid codierenden Region umfa£t, wobei die 3* DNA-Sequenz fahig ist, die Polyadenylation, 
Termination der Transkription und Termination der Translation von Messenger RNA, fOr die die 
Polypeptid codierende Region codiert, zu kontroHieren. 

40 4. Verfahren nach Anspruch 1 , dadurch gekennzeichnet, dafl die regulatorische Region aus Pichia pastoris 
NRRL Y-11430 stammt. 

5. Verfahren nach Anspruch 3. wobei das DNA-Fragment weiter eine oder mehrere zusatzliche DNA- 
Sequenzen umfaflt, die aus der Gruppe Bakterienplasmid-DNA, Bakteriophagen-DNA, Hefeplasmid- 

45 DNA und chromosomale Hefe-DNA stammen. 

6. Verfahren nach Anspruch 5, wobei die chromosomale Hefe-DNA eine autonom replizierende DNA- 
Sequenz und ein Marker-Gen enthMlt. 

so 7. Verfahren nach einem der AnsprUche 2-6. dadurch gekennzeichnet, dafl die Polypeptid codierende 
Region fUr ein heterologes Polypeptid codiert. 

6. Verfahren nach einem der AnsprOche 2-6, dadurch gekennzeichnet, dafl die Polypeptid codierende 
Region fUr Alkoholoxidase codiert. 

55 

9. Verfahren nach einem der AnsprOche 2-6, dadurch g kennzeichnet, daB die Polypeptid codier nde 
Region fUr Dihydroxyacetonsynthas codiert. 
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10. V rfahren nach einem der AnsprUch 2-6, dadurch gekennzeichnet, dafl die Polypeptid codierende 
Region fUr Polypeptid p40 codiert. 

11. Verfahren nach einem der AnsprUche 2-6, dadurch gekennzeichnet, daJ3 das Fragment in Form eines 
geschlossenen zirkularen Hybridplasmids ist. 

12. Aus Pichia pastoris stammendes Gen, das fur Alkoholoxidase codiert, wobei das Gen als regutatorische 
Region eine regulatorische Region hat, die in dem DNA-Fragment von Anspruch 1 enthalten ist. 

13. Aus Pichia pastoris stammendes Gen, das fOr Di hydroxy acetonsynthase codiert, wobei das Gen als 
regulatorische Region eine regulatorische Region hat. die in dem DNA-Fragment von Anspruch 1 
enthalten ist. 

14. Aus Pichia pastoris stammendes Gen, das fOr Polypeptid p40 codiert, wobei das Gen als regulatorische 
Region eine regulatorische Region hat, die in dem DNA-Fragment von Anspruch 1 enthalten ist. 

15. Verfahren nach Anspruch 1, wobei der Pichia pastoris Wirt GS 115 (NRRL Y-15851) ist. 

16. Verfahren nach einem der AnsprUche 1-15, wobei das Wachstums-Medium zum Kultivieren des 
transformierten Hefestammes. Methanol enthalt. 

Revendications 

Revendi cations pour les Etats contractants sulvants : BE, CH, DE, FR, GB, IT, U, LU, NL, SE 

1. Fragment d'ADN comprenant une region regulatrice obtenu a partir de Pichia pastoris, dans lequel 
cette region regulatrice repond a la presence de methanol dans le milieu de culture avec lequel un 
organisme h6te pour ce fragment d'ADN est en contact, dans lequel cette region regulatrice est 
capable de commander la transcription de TARN messager lorsqu'il est dispose a I 'extremity 5' d'un 
ADN qui code pour cet ARN messager, cette region regulatrice etant choisie parmi : 

(a) la region regulatrice qui commande la transcription de TARN messager qui code pour la 
dihydroxyac£tone synthetase pouvant §tre obtenue a partir d'un clone pPG6.0 (NRRL B- 15867) du 
site de restriction 5'-Hindlll au site de restriction 3'-Xhol; 

(b) la region regulatrice qui commande la transcription de TARN messager qui code pour Palcool 
oxydase pouvant etre obtenu a partir du clone pPG4.0 (NRRL B-15868) du site de restriction 5'- 
EcoRI au site de restriction 3*-EcoRV; 

(c) la region regulatrice qui commande la transcription de TARN messager qui code pour le p40 
pouvant §tre obtenu a partir du clone pPG4.8 (NRRL B-15869) du site de restriction 5'-BamHI au 
site de restriction 3'-Sall 

2. Fragment d'ADN selon la revendication 1 , comprenant en outre une region codant pour un polypeptide, 
dans lequel cette region regulatrice est placee a I'extremite 5' de cette region codant pour un 
polypeptide. 

3. Fragment d'ADN selon la revendication 2, comprenant en outre une sequence 3* d'ADN en aval de la 
region de codage du polypeptide, dans lequel cette sequence, 3' d'ADN est capable de commander la 
polyadenylation, la terminaison de la transcription et la terminaison de la traduction de TARN messager 
pour laquelle code cette region de codage du polypeptide. 

4. Fragment d'ADN selon la revendication 1, caracterise" en ce que cette region regulatrice provient de 
Pichia pastoris NRRL Y-1 1 430. 

5. Fragment d'ADN selon la revendication 3, comprenant en outre une ou plusieurs sequences d'ADN 
suppiementaires obtenues h partir d'ADN de plasmide bacterien, d'ADN de bacte>iophase, d'ADN de 
plasm ide de levure et d'ADN chromosomique de levure. 

6. Fragment d'ADN selon la revendication 5, dans lequ I cet ADN chromosomiqu de levure comprend 
une sequence d'ADN se repliquant d'une manier autonome t un gene marqueur. 



71 



EP 0 183 071 B1 



7. Fragment d'ADN selon Tune quelconque des revendications 2 a 6, caracterise" en ce que cett region 
de codage du polypeptide code pour un polypeptide necrologue. 

a Fragment d'ADN selon Tune quelconque des revendications 2 a 6, caractense" en ce que cette region 
5 de codage du polypeptide code pour I'alcool oxydase. 

9, Fragment d'ADN selon Tune quelconque des revendications 2 a 6, caracterise en ce que cette region 
de codage du polypeptide code pour la dihydroxyacetone synthase. 

io 10- Fragment d'ADN selon Tune quelconque des revendications 2 a 6, caracterise' en ce que cette region 
de codage du polypeptide code pour le polypeptide p40. 

11. Fragment d'ADN selon Tune quelconque des revendications 2 a 6, caracterise en ce que ce fragment 
est sous la forme d'un plasmide hybride circulate term 6. 

75 

12. Gene obtenu a partir de Pichia pastoris, codant pour I'alcool oxydase, ce gene ayant comme region 
regulatrice une region regulatrice comprise dans le fragment d'ADN de la revendication 1 . 

13- Gene obtenu a partir de Pichia pastoris, codant pour la dihydroxyacetone synthase, ce gene ayant 
20 comme region regulatrice une region regulatrice comprise dans le fragment d'ADN de la revendication 
1- 

14. Gene obtenu a partir de Pichia pastoris, codant pour le polypeptide p40, ce gene ayant comme region 
regulatrice une region regulatrice comprise dans le fragment d'ADN de la revendication 1 . 

25 

15. Souche de levure transformee en transformant Pichia pastoris comme h6te avec un vecteur d 'expres- 
sion portant une matiere d'ADN recombinant, dans lequel cette matiere d'ADN recombinant comprend 
le fragment d'ADN de la revendication 3. 

30 16. Souche de levure transformee de la revendication 15, dans lequel cet h6te Pichia pastoris est GS 115 
(NRRL Y-1 5851 ). 

17. Precede de preparation de polypeptides comprenant le fait de cultiver la souche de levure transformee 
des revendications 15 ou 16 dans un milieu nutritif et d'induire ('expression en presence de methanol. 

35 et de recuperer les polypeptides. 

18. Proc£d£ selon la revendication 17, dans lequel le milieu nutritif pour cultiver les souches de levure 
transformers des revendications 1 5 ou 16 contient du methanol. 

40 Revendications pour I'Etat contractant sulvant : AT 

1. Procede de preparation de polypeptides comprenant la culture d'une souche de levure transformee 
obtenue en transformant Pichia pastoris comme hote avec un vecteur d'expression portant une matiere 
d'ADN recombinant dans lequel cette matiere d'ADN recombinant comprend un fragment d'ADN 
45 comprenant une region regulatrice obtenue a partir de Pichia pastoris dans laquelle cette region 
regulatrice repond a la presence de methanol dans le milieu de culture avec lequel ce fragment d'ADN 
est en contact, dans laquelle cette region regulatrice est capable de commander la transcription de 
TARN messager lorsqu'elle est disposee a I'extremite 5' d'un ADN qui code pour cet ARN messager, 
cette region regulatrice etant choisie parmi : 
so (a) la region regulatrice qui commande la transcription de TARN messager qui code pour la 

dihydroxyacetone synthetase pouvant etre obtenue a partir d'un clone pPG6.0 (NRRL B-15867) du 
site de restriction 5*-HindllI au site de restriction 3'-Xhol; 

(b) la region regulatrice qui commande la transcription de TARN messager qui code pour I'alcool 
oxydase pouvant etre obtenu a partir du clone pPG4.0 (NRRL B-15868) du site de restriction 5'- 

55 EcoRI au site de restriction 3'-EcoRV; 

(c) la region regulatrice qui command la transcription de TARN messager qui code pour le p40 
pouvant etre obtenu a partir du clone pPG4.8 (NRRL B-15869) du site de restriction S'-BamHI au 
sit de restriction 3'-Sall; 
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dans lequel cett culture s'effectue dans un milieu nutritif et induit 1'expression en presenc de 
methanol et r^cupere les polypeptides. 

2. Procede* selon la revendication 1, dans lequel ce fragment d'ADN comprend en outre une region de 
5 codage de polypeptide, dans lequel cette region rSgulatrice est dispose© a Pextrgmite* 5* de cette 

region de codage de polypeptide. 

3. Proc£d6 selon la revendication 2, dans lequel ce fragment d'ADN comprend en outre une sequence 3 f 
d'ADN en aval de la region de codage du polypeptide, dans lequel cette sequence 3' d'ADN est 

to capable de commander la polyad6nylation, la terminaison de la transcription et la terminaison de la 
traduction de TARN messager pour laquelte code cette region de codage du polypeptide. 

4. Procede* selon la revendication 1, caracte>ise* en ce que cette region r£gulatrice provient de Pichia 
pastoris NRRL Y-11430. 

75 

5. Precede* selon la revendication 3, dans lequel ce fragment d'ADN comprend une ou plusieurs 
sequences d'ADN supplSmentaires obtenues a partir d'ADN de plasmide bacterien. d'ADN de bacterio- 
phage, d'ADN de plasmide de levure et d'ADN chromosomique de levure. 

20 6. Proce'de* selon la revendication 5, dans lequel cet ADN chromosomique de levure comprend une 
sequence d'ADN se repliquant d'une maniere autonome et un gene marqueur. 

7. Procede* selon I'une quelconque des revendications 2 a 6, caracteVise* en ce que cette region de 
codage du polypeptide code pour un polypeptide necrologue. 

25 

8. Procede* selon I'une quelconque des revendications 2 a 6, caractSrise* en ce que cette region de 
codage du polypeptide code pour I'alcool oxydase. 

9. Procede" selon I'une quelconque des revendications 2 a 6, caract^rise en ce que cette region de 
30 codage du polypeptide code pour la dihydroxyac£tone synthase. 

10. Procede* selon I'une quelconque des revendications 2 a 6, caracte>ise* en ce que cette region de 
codage du polypeptide code pour le polypeptide p40. 

35 11. Proce'de* selon I'une quelconque des revendications 2 a 6. caracte>is6 en ce que ce fragment est sous 
la forme d'un plasmide hy bride circulate ferme\ 

12. Gene obtenu a partir de Pichia pastoris, codant pour I'alcool oxydase, ce gene ayant com me region 
rSgulatrice une region r^gufatrice comprise dans le fragment d'ADN de la revendication 1. 

40 

13. Gene obtenu a partir de Pichia pastoris, codant pour la dihydroxyac&one synthase, ce gene ayant 
comme region rggulatrice une region rSgulatrice comprise dans le fragment d'ADN de la revendication 
1. 

45 14. Gene obtenu a partir de Pichia pastoris, codant pour le polypeptide p40, ce gene ayant comme region 
rSgulatrice une region r£gulatrice comprise dans le fragment d'ADN de la revendication 1. 

15. Procede* de la revendication 1, dans lequel cet h6te Pichia pastoris est GS 115 (NRRL Y-15851). 

so 16. Proceed selon I'une quelconque des revendications 1 a 15, dans lequel le milieu nutritif pour cultiver la 
souche de levure transformed contient du methanol. 
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